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Abstract 

The  Programmable  Strat°gy  Theorem  Prover  (PSTP)  is  a theorem  proving 
program  for  the  first  order  predicate  calculus  using  the  linear  HESON 
procedure  as  the  inference  system.  The  linear  MESON  procedure  is  a new 
variant  of  the  model  elimination  theorem  proving  procedure  for  theories 
with  or  uithout  equality  which  is  an  affirmation  rather  than  a refutation 
procedure.  It  is  profitably  viewed  as  an  extension  of  the  problem- 
reduction  method.  The  fundamental  element  of  a linear  HESON  procedure 
deduction,  the  chain,  is  a representation  of  a set  of  goals  to  be  solved 
and  their  supergoals.  PSTP  is  designed  to  be  used  interactively  or  in  a 
fully  automatic  mode.  Some  features  of  PSTP  are  a general  mechanism  for 
spec  i tying  which  chains  are  to  be  retained  and  manipulated,  an  automatic 
procedure  for  storinq  and  retrieving  information  about  chains  when  this 
information  is  requested,  the  capability  of  specifying  an  ordering  fi.net  ion 
which  can  be  used  for  specifying  search  strategies,  and  a powerful  set  of 
commands.  Results  of  an  experiment  testing  some  "itnple  search  strategieo 
and  comparisons  with  results  from  other  theorem  proving  studies  are 
presented. 
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8.  Introduction 


The  first  section  of  this  report  describes  a new  theorem  proving 
procedure  for  the  first  order  predicate  calculus.  the  linear  MESON 
procedure,  a variant  of  model  elimination.  Although  *he  linear  MESON 
procedure  is  not  substantially  mere  powerful  than  related  -eso I ut i on 
procedures  except  in  the  capab  lity  for  restriction  or  differential 
treatment  of  inferences  by  particular  implicative  forms  cf  the  axioms,  it 
represents  a significant  increase  in  the  "naturalness"  of  proofs  by 
complete  inference  systems  because  it  can  dp  viewed  as  an  extension  of  the; 
prob  I em-reduct i on  method  (31. 

In  addition  to  the  standard  'subgoal ing"  mechanism  of  the  problem- 
reduction  method  (by  which  a goal  is  replaced  by  a set  of  subyoa I s whose 
solution  constitutes  a solution  to  the  goal)  represented  in  the  linear 
MESON  procedure  by  tne  extension  operation,  the  linear  MESON  procedure 
provides  a mechanism  for  soiving  a goal  one  ot  whose  subgoals  is  its 
logical  negation  requiring  reasoning  by  contradiction  (the  reduction 
oper  a t i on  i , an  oper  a t i on  for  eliminating  dup I i cate  goa Is  ( f ac  tor i za  t i on)  , 
and  operations  for  solving  problems  involving  the  equality  relation  (p- 
extens i on,  p-r eduction)  without  requiring  full  ax i oma t i za t i on  of  its 
properties.  Tnese  addeo  inference  operations  result  in  a complete 
inference  system  for  the  first  order  predicate  calculus  with  equality. 

The  second  section  describes  the  underlying  concepts  and  overall 
design  of  the  Programmable  Strategy  Theorem  Prover  (PSTP) , a theorem 
proving  program  employing  the  linear  MESON  procedure  as  its  inference 
system  and  designed  to  be  used  interactively  or  fully  automatically 
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Although  restricted  to  using  the  linear  MESON  procedure  as  its  inference 
system,  PSTP  provides  substantial  flexibility  in  specification  of  search 
strategies,  both  in  terms  of  deletion  criteria  (such  as  use  of  length, 
levei,  and  depth  bounds)  and  of  the  order  in  uhich  inference  operations  are 
to  be  performea  (such  as  depth  first,  breadth  first,  or  diagonal  search 
strategi es) . 

The  third  section  presents  the  resul  ls  of  a performance  study  on  a set 
of  problems  previously  tested  in  two  other  theorem  proving  performance 
studies.  Some  conclusions  are  drawn  concerning  relative  merits  of  tested 
search  strategies  on  empirical  and  philosophical  grounds  and  the  power  of 
the  linear  MESON  procedure  relative  to  other  procedures  as  exemplified  by 
results  from  the  other  studies. 
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1.  The  Linear  flESON  Procedure 


The  linear  MESON  procedure  is  a variant  of  tne  model  elimination 
theorem  proving  procedure  [6,7,8]  in  uhich  (1)  each  literal  of  the  top 
chain  and  the  derived  chains  of  a deduction  is  replaced  by  its  complement 
and  t<_)  implications  as  well  as  disjunctions  are  permitted  as  axioms.  It 
is  also  the  linear  form  of  the  MESON  procedure  [9)  specified  for  goa  I - 
suogoa  i trees  (MESON  stands  for  "model  elimination  subgoal  oriented"). 
Advantages  of  the  linear  MESON  procedure  are  (1)  the  linear  MESON 
pro  edure,  ihough  logically  equivalent  to  model  elimination,  has  the  form 
of  an  affirmation  rather  than  a refutation  procedure  and  its  proofs  have  a 
very  natural  interpretation  in  terms  of  goal-subgoal  trees  and  (2)  the 
linear  MESON  procedure  has  greater  expressive  power  than  a'ternative 
procedures  in  the  potential  use  of  implications  rather  than  disjunctions  to 
restrict  application  of  inference  operations  to  certain  literals  of  axioms 
or  f by  replacing  a disjunction  by  more  than  ore  implication)  to  facilitate 
differential  treatment  of  the  various  implicative  forms  of  axioms  during 
the  search  for  a solution  (this  capability  is  shared  by  model  elimination 
in  which  each  .ength  n input  clause  generates  n auxiliary  chains  only  the 
last  literal  of  which  can  be  matched  in  inference  operations). 

In  a theorem  proving  program  permitting  interaction  between  human  user 
and  mechanical  proof  procedure,  it  is  desirable  that  as  human-oriented  a 
procedure  as  possible  be  employed.  While  it  is  correctly  argued  that  all 
resolution  type  theorem  proving  procedures  are  machine-oriented  and 
notoriously  unsuited  to  extensive  human  computation,  it  is  our  contention 
that  the  linear  MESON  procedure  1 s more  human-oriented  than  other 
resolution  related  procedures. 


This  is  a direct  consequence  of  the  relationship  of  the  linear  MESON 
procedure  with  the  problem-reduction  method.  The  linear  MESON  procedure  is 
an  extension  of  the  problem-reduction  method  which  is  complete  for  the 
first  order  predicate  calculus  with  equality.  It  augments  the  problem- 
reduction  method  by  inference  operations  which  perform  reasoning  by 
contradiction  (reduction),  which  eliminate  duplicate  subgoals 
(factorization),  and  which  deal  with  the  equality  relation  (p-extens i on,  p- 
reduc  t i on) . 

The  linear  MESON  procedure  represents  the  state  of  a search  for 
solution  a9  a 9et  of  chains.  Each  chain  represents  a subtree  of  the  search 
space.  The  solution  of  all  the  subgoals  represented  in  a chain  constitutes 
a solution  of  the  top  goal.  Different  chains  represent  different 
alternative  partial  solutions  of  the  top  goal. 

More  specific  claims  of  the  linear  MESON  procedure  being  human- 
oriented  are  (1)  it  is  an  affirmation  rather  than  a refutation  procedure, 
(2)  in  keeping  literals  ordered  in  a chain,  it  automatically  prevents  (in 
goal-subgoal  tree  terms)  the  start  of  an  attempt  to  solve  another  subgoal 
in  the  chain  until  the  current  one  is  solved,  and  (3)  it  is  a procedure 
which  remains  complete  if  only  input  deductions  are  used.  An  input 
deduction  is  a deduction  in  which  each  element  of  the  deduction  (a  linear 
MESON  procedure  chain)  is  derived  by  an  inference  operation  applied  to  its 
predecessor  or  its  predecessor  and  an  (input)  axiom. 

In  combination,  the  last  two  items  permit  the  user  to  focus  his 
attention  on  a much  smaller  subset  of  the  available  data  than  is  possible 
for  many  resolution  based  procedures. 

In  preparing  a problem  fir  .nput  to  the  linear  MESON  procedure,  the 


input  formula  is  first  converted  to  prenex  form  with  its  matrix  in  the  form 
of  a conjunction  of  assertions  implying  a cone  I us  i on.  An  assertion  is  a 
(possibly  empty)  conjunction  of  literals  ( jntecedents)  implying  a 

disjunction  of  literals  (consequents) . Note  that  if  there  are  no 
antecedents  in  the  assertion,  the  assertion  is  just  a disjunction  of 
literals  (d | ? jync  ts) , i.e.,  a clause.  A conclusion  is  a conjunction  of 
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literals.  Schematically,  the  transformed  formula  is  in  the  form 


u 1 (A,  A.  . . nA^C,  V.  . . vC„)  A ...  A (A^A.  . . A,V^C‘,V.  . .vC^J  -♦  (G . A.  . . aG„)  ) 

' \ V * % 

where  A,  C and  G denote  literals,  Q denotes  a list  of  quantifiers,  and  p>l , 
q>l,  each  m(>3,  each  nt>l. 

The  transformed  formula  is  then  Skolemized  by  (1)  replacing  for  each 
universal  quantifier  in  the  prefix  all  occurrences  of  the  quantified 
variable  in  the  matrix  by  a unique  Skolem  function  with  all  the 
existentially  quantified  variables  whose  quantifiers  precede  the  universal 
quantifier  in  the  prefix  as  arguments  and  (2)  deleting  the  quantifier 
prefix. 

An  alternate  input  form  for  the  linear  NESQN  procedure  is  a 
conjunction  of  quantified  assertions  implying  a conclusion.  Schematically, 
Q i (A.  a.  , . aA^j+C,  v. . ,vC„^)  a ...  a Q p(A,  a.  . . aA^C,  v. . . vCn)  •*  QpJG,  a.  . . Aj^) 
where  A,  C and  G denote  literals,  Q denotes  a list  of  quantifiers,  and  p>l, 


q>l,  each  m >0,  each  n >1. 

I l 

This  input  form  is  Skolemized  by  (1)  replacing  in  each  assertion  al  i 
occurrences  of  each  existentially  quantified  variable  by  a unique  Skolem 
function  whose  arguments  are  the  variables  of  universal  quantifiers 
preceding  the  existential  quantifier,  (2)  replacing  in  the  conclusion  all 
occurrences  of  each  universally  quantified  variable  by  a unique  Skolem 
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function  whose  arguments  are  the  variables  of  existential  quantifiers 
preceding  the  universal  quantifier,  and  (3)  removing  all  quantifiers  from 
the  formula. 

Ex  amp  I e.  The  problem  is:  if  a is  a prime  number  and  a times  the 
square  of  some  number  u is  b then  a divides  b.  The  initial  formula  is: 

VxVyVzVw  (prime(x)  a y*z=w  a divides(x,w) 

-*  divides(x.y)  v di  vides(x.z) ) 
a Vx  x-,Vx<=square  (x) 
a VxVyVz  (x*y=z  -*  y*x=z) 
a VxVyVz  (x*y=z  -»  d i v i des  (x,  z) ) 
a 3u  a^square (u) =square (b) 
a pr i me (a) 

-*  divides(a.b) 

The  Skolemized  form  ready  for  input  to  the  linear  MESON  procedure  is: 

(prime(x)  a y-,vz-w  a h i v i des  (x,w)  -♦  divides(x.y)  v di  v i des  (x,  z) ) 
a x*x=square (x) 
a (y* y«z  -*  y*x=z) 
a (x*y  = z •*  divide s(x,z)) 
a a-.vsquare  (c)  “Square  (b) 
a pr i me  (a) 

-♦  divides(a.b) 

Problems  with  equality  (as  above)  can  be  introduced  without  the  need 
for  specifying  the  symmetry,  transitivity,  and  subst i tut i v i ty  axioms  if  the 
special  equality  inference  operations  (p-extensi on,  p-reduction)  are  used. 
The  equality  reflexive  (x=x)  and  functionally  reflexive  axioms  (e.g., 
square  (x)  ^square  ( x ) , x*y*x»vy)  theoretically  are  required.  The  latter  are 
not  present  in  the  above  example  since  no  special  equality  rules  are 
required  for  the  problem’s  solution.  (This  is  the  same  as  the  NUfll  example 
studied  in  Section  3.) 

By  virtue  of  its  derivation  from  model  elimination,  the  linear  MESON 
procedure  is  complete  (given  that  a compatible  set  of  inference  and 
postprocessing  operations  are  used)  provided  (1)  the  set  of  assertions  is 
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consistent  (this  requirement  is  equivalent  to  the  requirement  that  the  too 
chain  of  a model  elimination  Deduction  be  in  the  minimally  unsat  i sf  i ab  I e 
set  of  input  clauses)  ana  (2)  either  the  disjunctive  axiom  form  is  used  or 

a,i  implicative  forms  of  each  assertion  are  included  among  the  assertions. 

The  first  condition  can  be  eliminated  by  the  addition  of  a special 

contradiction  mechanism  defined  for  the  MESON  procedure  but  not  included  in 
this  formulation  of  the  linear  MESON  procedure  which  permits  the  proof  of 
any  conclusion  from  an  inconsistent  set  of  assertions.  The  second 

condition  can  sometimes  be  eliminated  in  practice  since  it  is  often  clear 
from  the  problem  structure  (as  in  the  case  of  Horn  formulas)  that  use  of  a 
subset  of  the  i nip  iicative  forms  r °su I t s in  the  possible  deduc  t i on  of  all 
the  chains  that  the  disjunctive  form  would.  Also,  although  the  resulting 
procedure  is,  in  general,  incomplete  it  is  sometimes  desirable  to  restrict 
the  search  for  a proof  by  not  presenting  the  procedure  with  a,  I the 
mplicative  forms  of  the  assertions. 

The  fundamental  element  of  a linear  MESON  procedure  deduction  is  the 
Qhain.  A chain  is  an  ordered  sequence  of  literals.  Two  types  of  literals 
are  distinguished:  A- ' i ter  a I s and  B- i i ter a I s . B- literals  correspond  to  the 
literals  present  in  clauses  in  resolution  theorem  proving.  A— I iterals 
record  ancestry  information  and  represent  (in  goal-subgoal  tree  terms) 
higher  goals.  All  the  literals  in  the  theorem  and  axioms  are  B- I iterals. 
An  A- literal  is  created  in  a newly  derived  chain  from  a B- literal  in  the 
parent  chain  when  a set  of  literals  whose  conjunction  implies  the  A- literal 
(a  set  of  subgoals  whose  solution  constitutes  a solution  to  the  goal 
represented  by  the  A-literal)  is  added. 

A linear  MESON  procedure  deduction  of  chain  K from  problem  P is  a 
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sequence  K0 K„  of  acceptable  chains  where  <0  is  the  conclusion  of  P,  <h 

is  K,  each  K,  ( 1 < i <n ) is  derived  from  K,_,  by  extension  (by  an  assertion 
C.(),  factorization,  reduction,  p-extension  (by  an  assertion  Q.,)  or  p- 
reduction,  and  each  C, is  an  assertion  of  P or  a lemma  (see  lemma  formation 
operation  below).  A solution  of  P (a  proof  of  the  conclusion  of  P)  is  a 
linear  MESON  procedure  deduction  of  the  empty  chain  from  P.  In  general, 
for  the  inference  system  to  be  complete,  the  negation  of  the  conclusion 
must  be  included  among  the  assertions.  The  definitions  of  acceptable 
chains  and  the  inference  operations  are  given  below.  Example  proofs 

illustrating  most  forms  of  the  inference  operations  appear  starting  on  page 

17. 

Batching.  If  the  two  arguments  to  the  matching  procedure  are  terms, 
the  matching  procedure  returns  the  most  general  unifier  of  the  terms.  If 
the  two  terms  are  not  unifiable,  the  matching  procedure  fails. 

If  the  two  arguments  to  the  matching  procedure  are  literals  and  are 
both  positive  (unnegated)  or  both  negative  (negated)  literals  with 
unifiable  atomic  formulae,  the  matching  procedure  returns  the  most  general 
unifier  of  the  atomic  formulae.  If  the  two  arguments  are  not  both  positive 

or  both  negative  or  the  atomic  formulae  are  not  unifiable,  the  matching 
procedure  fails. 

E*teng[pn.  The  extension  operation  takes  an  acceptable  chain  K and  an 
implication  (alt.,  disjunction)  C as  its  arguments.  Let  K’  and  C’  be 
variable  disjoint  variants  of  K and  C.  If  the  last  literal  of  K’  matches  a 
consequent  (alt.,  disjunct)  of  C’,  the  chain  consisting  of  K’  followed  by 
the  antecedents  and  the  complements  of  the  remaining  consequents  (alt.,  the 
complements  of  the  remaining  disjuncts)  of  C’  with  matching  substitution 
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applied  can  be  inferred.  Each  literal  of  the  derived  chain  descended  from 
a literal  of  K'  is  designated  to  be  the  same  type  of  literal  as  its 
ancestor  except  the  last  which  is  designated  to  be  an  A- literal;  each 
liieral  of  the  derived  chain  descended  from  a literal  of  C*  is  designated 
to  be  a B- I i tera I . 

Factor  i zat  i on.  The  factorization  operation  takes  an  acceptable  chain 
K as  its  argument.  If  the  last  literal  of  K matches  a preceding  B-literal 
of  K,  the  chain  consisting  of  K with  the  last  literal  removed  and  with 
matching  substitution  applied  can  be  inferred.  Each  literal  of  the  derived 
chain  is  designated  to  be  the  same  type  as  its  ancestor. 

Reduct i on.  The  reduction  operation  takes  an  acceptable  chain  K as  its 
argument.  If  the  last  literal  of  K matches  the  complement  of  a preceding 
A—  I i ter  a I of  K,  the  chain  consisting  of  K with  the  last  literal  removed  and 
with  matching  substitution  applied  can  be  inferred.  Each  literal  of  the 
derived  chain  is  designated  to  be  the  same  type  as  its  ancestor. 

P-extens  i on.  The  p-extension  ("p-"  for  paramodu  I at  i on)  operation 
takes  an  acceptable  chain  K and  an  implication  (alt.,  disjunction)  C as  its 
arguments.  Let  K’  and  C’  be  variable  disjoint  variants  of  K and  C.  (a)  If 
a consequent  (alt.,  disjunct)  of  C’  is  of  the  form  a=b  or  b=a  where  a 
matches  a term  in  the  last  literal  of  K’,  the  chain  consisting  of  K’ 
followed  by  the  antecedents  and  the  complements  of  the  remaining 
consequents  (alt.,  the  complements  of  the  remaining  disjuncts)  of  C* 
followed  by  a copy  of  the  last  literal  of  K’  with  a single  instance  of  the 
term  matching  a replaced  by  b with  matching  substitution  applied  can  be 
inferred.  Each  literal  of  the  derived  chain  descended  from  a literal  of  K* 
is  designated  to  be  the  same  type  of  literal  as  its  ancestor  except  the 
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last  which  is  designated  to  be  an  A- literal;  each  literal  of  the  derived 
cha.n  descended  from  a litera'  of  C’  is  designated  to  be  a B-literal;  the 
last  literal  of  the  derived  chain  (in  which  an  instance  of  a term  matching 
a was  replaced  by  b)  is  designated  to  be  a B-literal.  This  form  of  p- 
extens  i on  is  called  p-extension  from  an  assertion.  (b)  If  the  last  literal 
of  K is  of  the  form  a*b  or  b*a  where  a matches  a term  in  a consequent 
(alt.,  disjunct)  of  C’,  the  chain  consisting  of  K’  followed  by  the 
antecedents  and  the  complements  of  the  remaining  consequents  (alt.,  the 
complements  of  the  remaining  disjuncts)  of  C*  followed  by  a copy  of  the 
complement  of  the  consequent  (alt.,  disjunct)  containing  the  term  matching 
a with  a single  instance  of  that  term  replaced  by  b with  matching 
substitution  applied  can  be  inferred.  Each  literal  o*  the  derived  chain 
descended  from  a literal  of  K’  is  designated  to  be  the  same  type  of  literal 
as  its  ancestor  except  the  last  which  is  designateu  to  be  an  A-literal; 
each  literal  of  the  derived  chain  descended  from  a literal  of  C’  is 
designated  to  be  a B-literal;  the  last  literal  of  the  derived  chain  (in 
which  an  instance  of  a term  matching  a was  replaced  by  b)  is  designated  to 
be  a B-literal.  This  form  of  p-extensior.  is  called  p-extension  t£  an 
assert  ion. 

P-reducLi  on.  The  p-reduction  ("p-"  for  paramodu lat i on)  operation 
takes  an  acceptable  chain  K as  its  argument.  (a)  If  the  last  literal  of  K 
contains  a term  matching  the  term  a where  a preceding  A-literal  of  K \z  of 
the  form  a*b  or  b*a,  the  chain  consisting  of  K followed  by  a copy  of  the 
last  literal  with  a single  instance  of  the  term  matching  a replaced  by  b 
with  matching  substitution  applied  can  be  inferred.  Each  literal  of  the 
derived  chain  descended  from  a literal  of  K is  designated  to  be  the  same 
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type  of  literal  as  its  ancestor  except  the  last  uhich  is  designated  to  be 
an  A-hteral;  the  last  literal  of  the  derived  chain  (in  uhich  an  instance 
of  a term  matching  a uas  replaced  by  b)  is  designated  to  be  a B-hteral. 
This  form  of  p-reduction  is  called  p-reduction  from  an  A—  I i ter  a I . (b)  If 
the  last  literal  of  K is  of  the  form  a*b  or  b*a  uhere  a matches  a term  in  a 
preceding  A-literal  of  K or  the  last  literal  of  K itself,  the  chain 
consisting  of  K folioued  by  a copy  of  the  preceding  A-literal  or  last 
Mteral  uith  a '-ingle  instance  of  the  term  matching  a replaced  by  b uith 
matching  suDstitution  applied  can  be  inferred.  Each  literal  of  the  derived 
chain  descended  from  a literal  of  K is  designated  to  be  the  same  type  of 
literal  as  its  ancestor  except  the  last  uhich  is  designated  to  be  an  A- 
literal;  the  last  literal  of  the  derived  chain  (in  uhich  an  instance  of  a 
term  matching  a uas  replaced  by  b)  is  designated  to  be  a B- literal.  This 
form  of  p-reduction  is  called  p-reduction  t an  A-literal  or  self. 

If  the  parent  chain  K n the  p-extension  or  p-reduction  operation  is 
itself  derived  by  p-extension  or  p-reduction,  the  created  A-literal  in  the 
derived  chain  may  optionally  be  omitted  uith  completeness  unaffected  (see 
(81).  There  is  a tradeoff  here.  If  the  A-literal  is  omitted,  the  derived 
chain  19  shorter  and  easier  to  read,  and  some  future  possible  reductions 
and  p-reductions  may  be  eliminated.  On  the  other  hand,  especially  if  the 
postprocessing  operation  specifies  rejection  of  chains  containing  an  A- 
Mteral  folloued  by  an  identical  A-literal  or  B-litera!,  retention  of  the 
A-literal  miy  result  in  rejection  of  more  chains  as  being  unacceptable. 
For  example,  this  could  prevent  i epeated  p-exlension  by  a=b  from  creating 
an  endless  sequence  of  chains  ending  alternately  in  Pa  or  Pb. 

EfiStprpcessinq.  A postprocessing  operation  takes  a chain  K (output 
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from  the  extension,  factorization,  reduction,  p-extension  or  p-reduction 
open  tion)  as  its  argument  and  either  rejects  K as  being  non-acceptab  I e and 
thus  unusable  as  input  to  any  inference  operation  or  transforms  the  chain 
into  an  acceptable  chain.  Many  different  postprocessing  opeiations  can  be 
written  with  different  effects  regarding  efficiency  and  completeness.  Four 
postprocessing  operations  are  described  in  the  following  table.  The  table 
expresses  possible  relationships  between  each  pair  of  literals  in  the 
chain.  Ail  the  actions  corresponding  to  true  conditions  are  to  be 
performed  on  the  chain,  except,  of  course,  that  if  the  action  is  to  reject 

the  chain  then  no  other  conditions  need  be  checked  or  actions  need  be 
performed. 


Postprocessing  operations 


strong- 

save 

STRCNG- 

DELETE 

UEAK- 

SAVE 

WEAk- 

DELETE 

Cond i t i cn 

Action 

Action 

Action 

A- tion 

A-l  i teral  fol lowed  by 
identical  A-literal 

reject 
cha  i n 

reject 
cha  i n 

reject 
chai  n 

re  ject 
chain 

A-l i tera 1 fol lowed  by 
complementary  A-literal 

reject 
cha  i n 

reject 
cha  i n 

reject 

chain 

re  ject 
cha  i n 

A-l  i teral  fol lowed  by 
identical  B-l i teral 

re  ject 
chain 

reject 
cha  i n 

reject 

chain 

reject 
cha  i n 

A-l i teral  fol lowed  by 
complementary  B-l i teral 

de 1 ete 
fol 1 ow i ng 
B-l i teral 

de 1 ete 
f o 1 1 owi ng 
B- 1 i tera 1 

B- 1 i tera 1 foil  owed  by 
identical  A-l i teral 

re  ject 
chain 

reject 

chain 

B-l i teral  fol lowed  by 
complementary  A-literal 

re  ject 
chain 

reject 

chain 

B-l i teral  fol lowed  by 
identical  B-l i teral 

delete 
fol lowing 
B-l i teral 

delete 
fol  low i IT" 
B- 1 i tera 1 

B-l i teral  f o I lowed  by  reject 
complementary  B-l i teral  chain 
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reject 
cha  i n 


If  ail  the  action'  of  the  postprocessing  operation  specified  in  the 
table  have  been  performed  and  the  chain  is  not  rejected,  all  terminal  A- 
literals  of  the  chain  are  deleted.  This  terminal  A- literal  deletion  is 
called  con  tract i on. 

The  deletion  action  associated  with  the  "B-literal  followed  by 
identical  B-literal"  condition  is  called  around  factor i zat i on  since  it 
represents  an  instantiation-free  usage  of  a generalized  factorization 
operation  which  can  delete  non-terminal  B-literals.  Similarly,  the 
deletion  action  associated  with  the  "A-literal  followed  by  complementary  B- 
I i ter  a I condition  is  called  ground  reduct i on  since  it  represents  an 
instantiation-free  usage  of  a generalized  reduction  operation  which  can 
delete  non-terminal  B-literals. 

Several  additional  conditions  and  actions  can  be  used  in 
postprocessing  operations  such  as  the  following  which  are  available  in  PSTP 
but  were  not  used  in  the  present  study:  (1)  rejecting  chains  containing  a 
non-terminal  A-literal  that  is  an  instance  of  a unit  axiom  (single  literal 
input  assertion),  (2)  rejecting  chains  containing  a non-terminal  A-literal 
that  is  an  instance  of  a unit  lemma  (single  liberal  derived  assertion,  see 
lemma  formation  operation  below)  by  which  the  ancr^tor  chain  could  have 
been  extended  wien  the  A-literal  was  created,  (3)  deletion  of  B-literals 
which  are  instances  of  unit  axioms,  (4)  deletion  of  B-literals  which  are 
instances  of  unit  lemmas,  (5)  removal  of  all  literals  including  and 
following  the  A-literal  in  the  case  of  a B-literal  followed  by  an  identical 
A- 1 i ter  a i (this  is  called  factorization  truncation) . (6)  removal  of  all 
literals  including  and  following  the  second  A-literal  in  the  case  of  an  A- 
literal  followed  by  a complementary  A-literal  (this  is  called  reduction 
i pn) . A I I of  these  actions  can  be  shown  to  preserve  completeness. 
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For  problems  not  involving  the  equality  relation,  the  UEAK-SAVE  and 
wEAK-HtLETE  postprocessing  operations  yield  a complete  inference  system 
when  the  extension  and  reduction  operations  are  used.  The  WEAK 
postprocessing  operations  correspond  closely  to  the  chain  adm i ssab i I i ty 
criteria  for  model  elimination  of  [6:.  Also  for  problems  not  involving  the 
equality  relation,  the  STRONG-SAVE  and  STRONG-DELETE  postprocessing 
operations  yieid  a complete  inference  system  provided  the  factorization 
operation  is  used  in  audition  to  extension  and  reduction.  The  STRONG 
postprocessing  operations  correspond  closely  to  the  chain  acceptability 
criteria  for  strong  model  elimination  of  [7].  Note  that  with  these 
postprocessing  operations,  if  the  conclusion  is  variable  free,  its  negation 
need  not  be  included  among  the  assertions  since  extension  by  the  negation 
of  the  conclusion  in  any  deduction  from  the  conclusion  would  result  in  a 
non-acceptable  chain  with  an  A-iiteral  followed  by  an  identical  B- literal 
or  an  A- literal  followed  by  a complementary  A-iiteral. 

For  problems  involving  the  equality  relation,  a postprocessing 
operation  which  rejects  a chain  only  if  it  has  an  A-literal  followed  by  a 
complementary  A-literal  yields  a complete  inference  system  when  the 
extension,  reduction,  p-extension  and  p-reduction  operations  are  used. 
Also  for  problems  involving  the  equality  relation,  a postprocessing 
operation  which  rejects  a chain  only  if  it  has  a A-literal  followed  by  a 
complementary  A-literal  or  a B-literal  followed  by  an  identical  A-literal 
yields  a complete  inference  system  provided  the  factorization  operation  is 
also  used.  This  corresponds  closely  to  the  chain  perm i ssab i I i ty  criteria 
for  model  elimination  with  paramodulation  of  [8].  We  believe  (but  have  no 
proof)  tnat  for  problems  involving  the  equality  relation,  the  UEAK-SAVE  and 
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UEAK-DELETE  postprocessing  operations  yield  a complete  inference  system 
uhen  the  extension,  reduction,  p-extension  and  p-reduction  operations  are 
used  and  the  STRONG-SAVE  and  STRONG-DELETE  postprocessing  operations  yield 
a complete  inference  system  when  the  factorization  operation  is  also  used. 
The  equality  reflexive  (x=x)  and  functionary  reflexive  axioms  (e.g., 
f (x.yUf  (x,y))  theoretically  are  required  in  any  cas<=. 

Lemm.2  format i on.  An  additional  inference  operation  is  used  to  create 
neu  assertions  during  contraction.  A new  assertion  (called  a lemma) 
consisting  of  the  disjunction  of  the  terminal  A- literal  being  removed  and 
all  preceding  A- literals  whose  scope  (see  below)  exceeds  the  number  of  A- 
tera I s between  them  and  the  terminal  A-literai  and  the  complements  of  all 
preceding  B- literals  wnose  scope  exceeds  the  number  of  A-literals  between 
them  and  the  terminal  A-literai  can  be  inferred. 

The  scope  of  each  literal  in  the  conclusion  is  0 and  the  scope  of  each 

literal  added  to  a chain  in  the  extension  and  p-extension  operations  is  0. 

In  the  factorization  and  reduction  operations  (and  also  in  ground 

factorization  and  reduction  performed  by  the  postprocessing  operation),  the 

scope  of  the  leftmost  involved  literal  is  set  to  the  maximum  of  its 

previous  scope  and  the  number  of  A-literals  between  it  and  the  rightmost 

involved  literal.  In  the  p-reduction  operation,  the  scope  of  a literal 

descended  from  an  involved  A- literal  is  set  to  the  number  of  following  A- 

'iterals  in  the  derived  chain.  Each  other  literal  has  the  same  scope  as 

Its  parent  literal  in  the  parent  chain,  After  each  contraction  operation, 

the  scope  of  each  literal  is  set  to  the  minimum  of  its  previous  scope  and 

the  number  of  A-litmals  following  it  in  the  chain,  i.e.,  no  literal  will 

be  allowed  to  have  a scope  which  exceeds  the  number  of  following  A- 
I i tera I s. 
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Lemma  formation  creates  assertions  from  solved  goals.  Removal  of  an 
A- literal  by  contraction  does  not  mean  that  the  goal  it  represents  has  been 
solved  globally,  but  only  that  it  has  been  solved  in  the  environment  of  the 
Cham  of  which  it  was  a part.  Joined  in  disjunction  with  the  A-literal  is 
the  negation  of  each  of  the  assumptions  from  the  chain  used  in  the  solution 
of  the  A-iiteral.  Thus,  the  resulting  lemma  states  either  the  "solved"  A- 
literal  is  true  or  one  or  more  of  the  assumptions  was  false.  The 
assumptions  which  could  be  used  in  the  solution  of  the  A-literal  are  the 
negation  of  an  A-literal  (reduction,  o-reduction)  or  a B-literal 
(factorization).  The  scope  mechanism  keeps  track  of  the  assumptions  made 
with  respect  to  the  solution  of  each  goal. 

Lemma  formation,  while  it  may  generate  useful  assertions  during  the 
search  for  a proof,  is  not  required  for  completeness. 

Several  heuristics  are  available  to  eliminate  the  generation  of 
redundant  lemmas  such  as:  (1)  the  first  lemma  to  be  generated  after  an 
extension  operation  followed  by  zero  or  more  factorization  or  reduction 
operations  is  always  redundant  and  (2)  if  the  chain  has  two  or  more 
terminal  A-literals  and  the  lemma  associated  with  one  subsumes  the  lemma 
associated  with  another,  the  second  need  not  be  generated  (specific  cases 
of  this  condition  can  readily  be  checked  by  examining  the  scopes  of  the 
I i tera I s i nvo I ved) . 

gybSMmpt jgn.  Redundant  chains  and  assertions  can  be  eliminated  from 
future  use  by  subsumption.  One  chain  is  subsumed  by  another  and  can  thus 
be  eliminated  if  an  instance  of  the  latter  chain  is  an  initial  subsequence 
of  the  former  (sequences  of  B- literals  between  A-literals  may  be  freely 
reordered  during  the  subsumption  test).  One  assertion  subsumes  another  if 
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the  chain  corresponding  to  the  former  assertion  subsumes  the  chain 
corresponding  to  the  latter  (the  corresponding  chain  is  formed  by  making  a 
list  of  B-literals  being  the  consequents  and  complemented  antecedents  of 
implications  or  disjuncts  of  disjunctions)  and,  in  the  case  where  the 
subsuming  assertion  is  an  implication,  no  disjunct  or  consequent  of  the 
subsumed  assertion  is  matched  to  an  antecedent  of  the  subsuming  assertion. 

The  latter  provision  prevents  the  subsumption  of  an  assertion  by  another 
implicative  form  of  tne  same  assertion  both  of  which  may  be  required  for 
completeness.  Stronger  subsumption  rules  are  possible  (see  (8)). 

Uith  relation  to  a search  strategy,  two  additional  classes  of 
subsumption  are  recognized:  (1)  forward  subsumption  is  the  subsumption  of  a 
newly  created  chain  or  assertion  by  a previously  available  chain  or 
assertion  and  (2)  backward  subsumpt  i on  is  the  subsumption  of  a previously 
available  chain  or  assertion  by  a newly  generated  chain  or  assertion.  In 
general,  completeness  is  assured  only  if,  when  backward  subsumption  is 
used,  it  is  first  chtcked  whether  the  subsuming  chain  is  el i m i nab  I e by 
forward  subsumption. 

Examp  I es.  This  and  the  following  proofs  illustrate  the  usage  of  most 
forms  of  the  inference  operations.  Chains  are  represented  as  linear 
strings  of  literals  with  A- 1 i ter a I s bracketed.  A-literals  -epresent 
opened'  goals,  i.e.,  goals  for  which  a solution  is  currently  being 
attempted  in  the  chain.  B-literals  represent  "unopened"  goals,  i.e., 
subgoals  for  which  an  attempt  for  solution  has  not  yet  started.  Each  A- 
literal  is  a logical  consequence  of  all  the  literals  to  its  right;  thus, 
the  solution  of  each  B—  I i t era  I to  the  right  of  an  A-literal  solves  the  A- 
literal  while  also  solving  all  the  other  A-literals  to  the  right  of  the 
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solved  A-literal.  Deductions  are  represented  as  a vertical  sequences  of 
chains,  the  ancestor  of  each  derived  chain  being  the  chain  above  it.  Each 
derived  chain  is  annotated  to  describe  its  derivation  from  its  ancestor. 
If  a chain  is  the  result  of  extension  or  p-extension  by  an  assertion  with 
more  than  one  consequent  or  disjunct,  an  alphabetic  index  is  used  to 
designate  which  consequent  or  disjunct  was  used.  Indices  are  "a",  "b", 
'c',  etc.,  reading  from  right  to  left.  The  unannotated  chains  at  the  top 
of  each  sequence  of  chains  are  the  axioms.  Here  and  elsewhere  in  this 
paper,  the  conclusion  and  assertions  of  a problem  will  frequently  be 
referred  to  as  theorem  and  axioms  respectively. 

This  is  a proof  that  -(Pa  *»  Pb)  -♦  a*b. 


1.  Pa  v Pb 

2.  -Pa  v -Pb 

3.  a*b 


first  axiom  from  -IPa  «*  Pb) 
second  axiom  from  -(Pa  «»  Pb) 
theorem  to  be  established 


A.  [a*b]  -Pb  -Fb  p-extend  to  lb 

This  operation  initiates  a proof  by  contradiction.  Assuming  a=b 
(the  complement  of  the  literal  a*b) , the  truth  of  -Pb  a -Pb 
contradicts  Pa  v Pb. 


5.  (axb)  -Pb  factor 

It  is  only  necessary  to  prove  -Pb  once. 

G.  (axb)  I-PbJ  Pa  extend  by  2a 

By  axiom  2,  i f Pa  is  true  then  -Po  is  true. 


7.  iaxbl  1 — Pb]  [Pal  Pb  p-reduce  from  A-literal 

Again  assuming  a=b  to  derive  a proof  by  contradiction,  if  Pb  is 
true  then  Pa  is  true. 

8.  empty  reduce 

In  chain  7,  subject  to  the  assumption  that  the  conclusion  a*b  is 
false  and  that  a=b,  there  are  the  implications  Pb  -*  Pa  and  Pa  -♦ 
-Pb.  This  leads  to  Pb  -*  -Pb,  a contradiction  if  Pb  is  true. 
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Therefore,  -’Pb  must  be  assumed  to  be  true.  From  chain  S,  --Pb 
a*b,  so  the  theorem  is  proved. 


The  following  four  examples  are  adapted  with  some  modification  from 
193  . 


theorem 
extend  by  1 
extend  by  2 
extend  by  3a 
reduce 
factor 
extend  by  4 


theorem 

p-extend  from  1 
extend  by  2 
extend  by  3 
extend  by  4 


1.  a>0  -*  a>0 

2.  ->b>0 

3.  a>0 

4.  a*b 

5.  ta*b]  a>0  -b>0 
B.  Ea*b]  a>0 
7.  empty 

1 . a+b=2vfc  -»  axb  v a=c 

2.  b*c 

3.  a+b=2-.vc 

4.  a*b 

5.  Ea*b]  a+b=2*c  axe 

6.  ta*b]  a+b«=2*c  Ea*c]  b*c 

7.  Ea*b]  a+b=2*c 

8.  empty 


theorem 
extend  by  lb 
p-reduce  from  A-literal 
extend  by  2 
extend  by  3 


theorem 
p-extend  to  1 
extend  by  2 
extend  by  3 


1.  B -*  C 


2. 

A a -*0  -* 

B 

3. 

A -»  C v - 

0 

4. 

A 

5. 

C 

6. 

EC3  B 

7. 

IC1  [B3  A 

-0 

8. 

EC]  CB]  A 

E-DJ  A C 

9. 

EC]  EBJ  A 

E-0]  A 

10. 

EC]  IB]  A 

11. 

empty 

1. 

b<a  a a<b 

-*  a=b 

2. 

b>0 

3. 

a<b 

4. 

b<a 

5. 

aiZ 

G. 

Ea>0]  b<a 

a<b  b>0 

7. 

Ea>0]  b<a 

a<b 

8. 

Ea>0]  b<a 

9. 

empty 
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2.  The  Programmable  Strategy  Theorem  Prrver 


The  Progranimab  I e Strategy  Theorem  Prover  (PSTP)  is  a program  written 
in  UC!  LISP  13]  for  the  DECsystem-10  computer  implementing  the  linear  MESON 
procedure  as  described  in  Section  1. 

The  linear  MESON  procedure  is  a good  inference  system  for  an 
interactive,  programmable  strategy  theorem  prover  since,  it  being  an 
extension  of  the  prob  I em-reduc  t i on  method,  it  is  more  human-oriented  than 
alternative  systems,  and  ' ts  input  procedure  nature  and  the  relatively 
small  number  of  operations  that  can  be  performed  on  any  chain  facilitates 
the  design  and  use  of  the  programmable  search  strategy  capability. 

In  addition,  the  l;near  MESON  procedure  is  a suitable  choice  in  terms 
of  performance  since  it  appears  to  perform  competitively  with  other 
inference  procedures  when  used  as  the  inference  system  in  a fully  automatic 
system.  An  implementation  of  the  parent  model  elimination  procedure  at  New 
York  University  [4]  using  a depth  first  search  strategy  performed 
compe  tet  i ve  I y with  a theorem  prover  employing  the  set  ot  support  refinement 
and  unit  preference  search  strategy.  Further  evidence  of  the 
competitiveness  of  linear  MESON  procedure  based  systems  will  be  presented 
i n Sect i on  3. 

C.haJ ri  pr oper ties.  In  the  design  of  a theorem  proving  program,  it  is 
necessary  to  allow  fur  the  computation  and  retention  of  certain  information 
about  each  chain  (clause)  generated  during  the  search  for  a proof.  An 
example  is  the  necessity  of  retaining  information  on  parentage  of  each 
chain  so  a proof  can  be  traced  when  discovered.  Another  example  is  the 
computation  of  the  length  of  a chain  or  the  maximum  level  of  function 
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nesting  in  the  chain  if  length  or  depth  bounds  are  being  employed.  It 
uou  i d be  wasteful  to  always  compute  and  store  such  information  since  it  may 
not  always  be  needed.  Also,  the  retention  of  computed  information  about 
chains  should  be  contingent  on  such  variables  as  the  computational  effort 
required  to  compute  the  information,  frequency  of  use  of  the  information, 
and  cost  in  memory  of  storing  the  information.  (A  more  fundamental 
objection  to  always  storing  computed  information  about  a chain  is  that  the 
information  might  change  with  time.  For  example,  the  fact  that  a 
particular  chain  is  the  shortest  chain  in  memory  will  probably  be  falsified 
in  the  future.)  Another  important  consideration  in  the  design  uf  an 
information  storage  and  retrieval  mechanism  for  chains  is  the  ability  to 
define  new  data  which  can  be  optionally  computed  for  any  chain. 

This  iast  consideration  is  especially  important  in  an  interactive 
theorem  proving  program  so  that  the  user  can  cause  to  be  computed  whatever 
information  about  a chain  will  be  useful  to  him.  It  is  also  an  important 
consideration  in  the  design  of  a theorem  proving  program  which  allows  user 
specification  of  search  strategies. 

The  property  storage  and  retrieval  mechanism  for  chains  in  PSTP  was 
designed  to  possess  the  following  characteristics.  LJith  the  obvious 

j 

exceptions  of  the  number  of  a chain  and  its  ancestry  information,  no 
information  about  a chain  is  computed  unless  and  until  this  information  is 
requested.  Retaining  the  information  is  a user  option.  A new  computable 
datum  about  a chain  can  be  defined  be  merely  defining  the  LISP  function 
which  computes  the  information. 

The  mechanism  used  is  based  on  the  concept  of  a chain  property  list. 

This  is  a list  of  dotted  pairs;  the  first  component  of  each  dotted  pair  is 
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a property  name  (the  access  name  of  a datum  about  the  chain);  the  second 
component  is  the  value  of  the  property.  The  information  storage  and 
retrieval  mechanism  functions  in  the  following  way.  If  the  value  of  the 
ps  operty  named,  for  example,  NUT  (this  represents  in  PSTP  the  number  of 
literals  in  the  chain)  is  interrogated  for  a chain,  that  chain’s  property 
list  is  examined  for  a dotted  pair  with  'ust  component  NUT.  If  such  a 
dotted  pair  is  found,  its  second  component  is  the  desired  information.  If 
property  NUT  does  not  appear  in  the  chain  property  list,  the  LISP  function 
NLiT  is  evaluated  with  the  chain  (including  chain  property  list)  as  its 
single  argument.  The  value  the  function  NLIT  returns  is  then  the  desired 
information.  Further,  if  ine  LISP  atom  NLIT  has  non-NIL  value,  the 
property  name  NLIT  and  newly  computed  value  will  be  added  to  the  chain 
property  list. 

New  properties  are  defined  by  the  DP  ("define  property")  function. 
The  DP  function  takes  as  arguments  a function  name,  lambda  variable  list, 
and  expression  (just  like  the  UCI  LISP  DE,  DF,  and  DN  functions).  It 
creates  a LISP  function  which  performs  all  the  chain  property  list  lookup 
and  modification  operations,  and  evaluates  expression  for  the  argument  if 
the  property  value  is  not  found  on  the  chain  prrperty  list.  For  example, 
NLIT  is  defined  in  PSTP  by  evaluating  (DP  NLIT  (CHAIN)  (LENGTH  (CDDR 
CHAIN)))  where  LENGTH  is  the  LISP  function  which  computes  the  length  of  a 
list  and  (CDDR  CHAIN)  is  the  location  of  the  list  of  literals  of  chain 
CHAIN.  (SETQ  NLIT  T)  is  then  evaluated  to  order  retention  of  values 
computed  by  the  NLIT  function. 

Some  of  the  property  functions  already  defined  in  PSTP  compute  the 
number  of  A-literals  in  a chain  (NALIT),  the  number  of  B-literals  in  a 
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chain  (NBLIT),  the  total  number  of  literals  in  a chain  (NUT),  the  maximum 
function  depth  in  a chain  (DEPTH),  the  number  of  variables  in  a ciain 
(NVAR),  the  number  of  LISP  CONS  operations  required  to  construct  a chain 
(SIZE)  (this  is  a good  size  or  complexity  function),  and  the  level  (number 
of  inference  operations  in  the  derivation)  of  a chain  (NEXPAND) . 

£tl3-i.n  LLl-tgrs.  This  property  storage  and  retrieval  mechanism  supports 
a higher  level  chain  storage  and  retrieval  mec.ianism.  Filters  provide  a 
way  of  flexibly  specifying  which  chains  are  to  be  operated  upon  and  which 
derived  chains  are  to  be  stored.  Two  types  of  filters  are  distinguished  by 
usage,  input  filters  and  output  filters.  I npu t filters  a~e  employed  by  the 
user  to  specify  which  chains  are  to  be  operated  upon.  Only  chains 
selected  by  an  input  chain  filter  will  be  processed.  Output  filters  are 
used  to  specify  which  derived  chains  are  to  be  retained.  A chain  must  be 
accepted  by"  an  output  filter  to  be  stored.  The  general  form  for  a filter 
is  a unary  LISP  function  name  or  lambda  expression  which  returns  a non-NIL 
value  if  the  chain  argument  is  selected  or  accepted,  NIL  otherwise. 
Several  abbreviated  forms  are  also  available:  (1)  an  integer  selects  or 
accepts  a chain  with  that  chain  number,  (2)  a list  of  integers  selects  or 
accepts  chains  with  chain  numbers  in  the  list,  (3)  a three  element  list 
(called  a triple)  consisting  of  a binary  function  name  and  2 integers  or 
property  names  selects  or  accepts  chains  for  which  the  value  of  the 
function  applied  to  the  integers  and  property  values  is  non-NIL,  and  (4)  a 
list  of  triples  which  selects  or  accepts  chains  for  which  each  triple  has 
non-NIL  value. 

This  chain  storage  ard  retrieval  mechanism  is  very  flexible.  The  user 
can  designate  chains  for  processing  directly  by  number  or  by  the  proper  tit: 
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they  possess  and  can  arbitrarily  specify  the  necessary  conditions  for  a 
newly  derived  chain  to  be  stored.  This  user  specification  of  output 
filters  is  a far  more  general  form  of  the  usual  specification  of  bounds  in 
theorem  proving  programs. 

Search  strateau  speci fi cat  ion.  One  of  the  most  important  features  of 
PSTP  is  its  capability  for  specifying  the  search  strategy  to  be  usee  in 
searching  for  a proof.  Several  theorem  proving  programs  (e.g.,  QA3.G  U13) 
permit  the  user  to  specify  a particular  combination  of  refinements  of 
resolution  (restrictions  on  pairs  of  clauses  to  be  used  as  input  to  the 
resolution  operation  (e.g.,  linear,  merging,  set  of  support,  model 
refinements)),  but  the  capability  for  ordering  inference  operations  given  a 
particular  refinement  uf  resolution  is  uncommon.  PSTP  is,  of  course, 
restricted  to  using  the  linear  MESON  procedure  with  variations  restricted 
to  different  postprocessing  operations,  but  it  does  have  a general 
capability  for  specifying  search  strategies. 

Before  describing  the  search  strategy  specification  capability  of 
PSTP,  it  is  instructive  to  consider  the  proof  strategy  employed  in  many 
other  theorem  proving  systems  in  which  search  strategy  is  fixed  with 
possibly  a few  parameters  which  the  user  can  specify  to  tailor  the  proof 
search  to  a particular  problem.  Thus,  the  search  strategy  may  be 
fundamentally  depth  first  or  perhaps  breadth  first  with  a parameter 
specifying  the  permitted  amount  of  look-ahead  using  unit  preference.  Much 
of  the  control  the  user  has  over  such  systems  is  the  specification  of  which 
chains  to  discard.  However,  even  this  decision  is  severely  constrained. 
Usually,  the  user  is  only  permitted  to  specify  the  values  of  a few 
parameters  such  as  the  maximum  length  or  function  depth  of  clauses  to  be 
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Ue  have  seen  that  output  filters  generalize  the  capability  of 


specifying  retention  of  chains.  Chain  order  functions  provide  the 


capability  of  specifying  the  order  of  expansion  of  the  search  space, 


Lhain  order  functions.  Associated  with  each  list  of  chains  is  the 


name  of  an  order  function.  (The  order  function  is  actually  a chain 


property  function  as  described  above.)  Whenever  a chain  is  stored  in  a 


chain  list,  it  is  inserted  according  to  the  numerical  value  of  the 


corr espond i nr  order  function  applied  to  the  chain.  The  chains  with  the 


smallest  values  of  the  order  function  are  stored  at  the  top  of  the  chain 


list  (in  case  of  ties,  the  more  recently  stored  chains  will  be  on  top  of 


he  chain  list).  This  maintenance  of  chain  lists  in  sorted  order  in 


combination  with  the  SEARCH  and  SEARCH2  commands  provides  a quite  General 


capability  for  specifying  search  strategies. 


Sear gh  commands.  The  SEARCH  command  is  one  of  the  fundamental 


functions  for  automatically  expanding  the  search  space.  The  normal  mode  of 


operation  is  for  the  SEARCH  command  to  remove  the  top  chain  from  a chain 


list,  derive  all  possible  immediate  successor  cnains  from  this  chain  (this 


is  known  as  expanding  the  chain),  and  store  those  successor  chains  selected 


by  an  output  filter  in  the  original  chain  list  acccrding  to  its  order 


function.  Thus  by  specifying  an  order  function  and  using  the  SEARCH 


command,  the  user  can  specify  in  what  order  chains  are  to  be  expanded  and 


thus  partially  control  the  search  strategy.  For  example,  if  the  default 


order  function  (which  merely  returns  0)  is  used,  the  search  strategy  is  a 


depth  first  strategy.  If  the  deduction  level  of  a chain  is  used  as  order 


function,  me  search  strategy  is  a pure  bi  e^Jth  first  strategy  (level 


saturation).  The  SEARCH  command  can  be  viewed  as  an  implementation  of 
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Nilsson  s A*  algorithm  [10]  for  graph  searching  as  applied  to  theorem 
proving.  Each  derived  chain  is  a node  in  the  graph  and  the  generation  of 
all  immediate  successors  to  a chain  by  extension,  factorization,  etc., 
represents  the  expansion  of  a node  in  the  A*  algorithm. 

Although  the  SEARCH  command  is  very  effective  in  ordering  the 
expansion  of  chains,  the  full  expansion  of  a chain  at  each  step  often 
results  in  generating  a large  number  of  chains  that  will  not  be  used 
because  the  value  of  the  order  function  for  these  chains  exceeds  the 
maximum  order  function  value  of  any  chain  appearing  in  some  proof.  This 
presents  two  difficulties:  (1)  unused  high  order  function  valued  chains 

UP  memory  too  quickly  and  (2!  their  generation  requires  extra, 
unnecessary  work.  The  first  of  these  problems  could  be  solved  by 
specifying  an  output  filter  that  rejects  chains  with  order  function  value 
exceeding  a certain  amount.  However,  this  solution  generates  a bounded 
search  strategy,  i.e.,  a parameterized  incomplete  strategy  ulvch  may  fail 
to  find  a proof  because  the  order  function  maximum  is  set  too  low. 
Moreover,  the  specification  of  a bounded  search  strategy  fails  to  solve  the 
problem  of  extra  work  required  in  the  generation  of  rejected  chains. 

The  solution  adopted  for  PSTP  includes  a means  for  specification  of 
ordering  of  individual  inference  operations  rather  than  just  chains.  The 
form  of  the  value  of  the  order  function  was  generalized  to  include  sorted 
lists  of  operations  with  numerical  values.  For  example,  the  order  function 
value  ((201  REDUCE)  (302  EXTEND  S))  could  represents  the  order  function 
value  for  a chain  with  previously  unperformed  operations  of  reduction  and 
extension  by  chain  number  6.  Chains  with  order  function  values  of  this 
form  are  inserted  into  chain  lists  according  to  the  numerical  value  of  the 
first  specified  operation  (201  in  the  example). 
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The  SEARCH?  command  is  designed  to  operate  on  chain  lists  with  order 
functions  of  the  new  form.  The  SEARCH2  command,  rather  than  deleting  and 
expanding  the  top  chain  on  the  chain  list,  deletes  and  performs  omy  the 
first  inference  operation  of  the  order  function  va'ue  o#  the  top  chain  on 
the  cham  list.  If  any  inference  operations  remain  in  the  order  function, 
the  top  chain  is  reinserted  in  the  chain  list  according  to  the  value  of  the 
next  specified  inference  operation.  Thus,  the  chain  list  is  always  a list 
of  chains  ordered  according  to  the  minimum  value  of  the  unperfo-awd 
inference  operations  for  that  chain.  Tne  SEARCH2  command  will  generate 
successors  of  a single  chain  uninterruptedly  only  so  long  as  none  of  iha 
successors  of  the  cham  or  any  other  chain  on  the  chain  list  has  an 
inference  operation  with  lower  numerical  value  than  the  next  inference 
operation  to  be  performed  on  the  current  chain. 

The  SEARCH2  command  can  also  be  viewed  as  an  implementation  of 
Nilsson’s  A*  algorithm  for  graph  searching  w.th  each  node  being  a chain  and 
d single  inference  operation.  Expansion  of  a node  now  merely  consists  of 
applying  that  inference  operation  to  that  chain. 

Using  the  SEARCH2  command,  for  example,  it  is  possible  to  use  an  order 
function  which  specifies  a depth  first  search  strategy  that  performs  only 
one  inference  operation  on  each  level  (until  a level  bound  is  reached 
forcing  a backup).  A more  realistic  order  function  for  use  with  tne 
bEARCH2  command  was  used  in  the  experiment  described  Section  3.  Note  that 
the  performance  of  the  single  specified  inference  operation  may  actually 
result  in  the  generation  of  more  than  one  successor  chain  as,  for  example 
when  there  are  two  ways  of  extending  a chain  by  a particular  axiom. 

Eflnmai  functions.  An  additional  mechanism  for  alteriny  the  strategy 
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used  by  PSTP  is  accomplished  by  the  use  of  format  functions.  Format 
funct.ons  can  be  used  to  reformat  (edit)  chains  prior  to  their  storage. 
For  example,  a format  function  can  be  used  to  reorder  the  last  B-literals 
of  a chain  to  accomplish  the  effect  of  Kowalski  and  Kuehner’s  literal 
selection  function  mechanism  [5].  A format  function  is  associated  with 
each  chain  list.  Each  chain  is  reformatted  according  to  the  order  function 
of  the  chain  list  into  which  it  is  being  stored  unless  it  is  already  in 
that  format.  A chain  may  be  stored  in  two  different  formats  in  two 
different  chain  lists  (formats  are  permitted  to  alter  the  sequence  of 
literals  which  constitutes  the  chain  itself,  but  not  the  chain  property 
list).  Default  format  functions  are  those  which  convert  chains  to  x- 
standardized  or  y-standardi zed  form  by  renaming  variables. 

Comn  and  summary.  Following  is  a br  iet  description  of  each  of  most  of 
the  PSTP  commands.  These  PSTP  commands  can  be  divided  into  four  classes: 
dec ' arat  i c commands  (CHAINLIST,  PARAMETERS,  POSTPROCESSING,  PROBLEM) , 
informative  commands  (ANCESTRY,  COUNT,  DISPLAY),  manipulative  commands 
(COPY,  DELETE,  FOR,  TRANSFER),  and  inference  commands  (EXPAND,  SEARCH, 
SEARCH2) . An  abbreviated  syntax  for  each  command  is  also  presented; 
linguistic  variaoles  are  enclosed  in  angle  brackets  (e.g.,  "<sources>" ) and 
optional  command  phrases  are  enclosed  in  square  brackets  (e.g.,  "[DELETE!" 

and  "[TO  <dest  i nat  i ons>] " ) . If  any  phrase  of  a command  is  absent,  a 
default  value  will  be  used. 

In  the  descriptions  of  the  commands,  the  most  important  linguistic 
variables  are  <sources>  and  <destinations>.  A source  or  destination 
represents,  in  general,  a chain  list  and  a chain  filter.  For  a chain  to  be 
used  by  a command  specifying  <sources>,  it  must  be  a member  of  one  of  the 
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specified  chain  lists  and  be  selected  by  the  corresponding  chain  filter. 
For  a chain  to  be  stored  in  the  chain  list  of  a destination  by  a command 
specifying  <dest  i nat  i ons>,  it  must  be  accepted  by  the  corresponding  chain 
filter.  Note  that  in  the  syntax,  <sources>  and  <dest inat ions>  are  never 

optional  (except  as  part  of  an  optional  phrase).  This  is  because  the  emp^y 

specification  for  <scurces>  and  <destinations>  is  legal  and  has  a default 
va I ue. 

Any  of  the  non-declarative  commands  can  be  interrupted  at  any  time  by 
typing  any  character.  The  command  completes  processing  of  the  current 
chain  and  then  enters  a "break".  In  this  state,  the  user  can  execute  any 
PSTP  command  or  LISP  function  and  continue  or  aoort  the  processing  of  the 
interrupted  command. 

ANCESTRY  command.  The  (ANCESTRY  [DELETE!  <sources>)  command  prints 
the  derivation  of  each  chain  designated  by  <sources>.  If  DELETE  is 
specified,  each  designated  chain  is  also  deleted  from  its  chain  list. 

CHAINLIST  command.  The  (CHAINLIST  (<dec I arat i ons>]  ) command  is  used 
to  declare  cham  lists  that  mil  be  used  and  their  format  and  order 
functions.  If  <dec I arat i ons>  is  absent,  the  CHAINLIST  command  prints  the 
list  of  previously  declared  chain  lists  and  their  format  and  order 
funct ions. 

CQPV  command.  The  (COPY  (DELETE)  <sources>  (TO  <dest i nat i ons>) ) 
command  copies  each  chain  designated  by  <sources>  to  each  of 
<dest inat ions>.  If  DELETE  is  specified,  each  designated  chain  is  also 
deleted  from  its  chain  list. 

CflUtJI  EM-  The  (COUNT  (DELETE)  <sources>)  command  counts  the 
number  of  chains  designated  by  <sources>.  If  DELETE  is  specified,  each 
designated  chain  is  also  deleted  from  its  chain  list. 
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DELETE  ggninigrid.  The  (DELETE  <sources>)  is  the  same  as  the  (COUNT 

<sources>)  command,  i.e.,  the  CDUNT  command  with  chain  deletion 
spec i f i ed. 

DISPLAY  command.  The  (DISPLAY  (DELETE]  <sources>)  command  prints  each 
chain  designated  by  <sources>.  if  DELETE  is  specified,  each  designated 
chain  is  also  deleted  from  its  chain  list. 

EXPAND  opnimand.  The  (EXPAND  (EXTEND]  [FACTDR]  (REDUCE]  (PEXTEND] 

(PREOUCE]  (DELETE]  <sourcesl>  (BY  <sources2>]  (GIVING  <dest inat i onsl>]  (AND 
<dest inat ions2>) ) is  the  principal  inference  command  for  interactive  use. 
It  will  perform  the  designated  inference  operations  on  each  of  the  chains 
designated  by  <sourcesl>  using  each  chain  designated  by  <sources2>  as 
second  argument  to  binary  inference  operations  (extension  and  p-extens i on) . 
Derived  chains  will  be  stored  in  <dest i nat i onsl>  and  lemmas  will  be  stored 
in  <dest  i nat  i ons2>  (chain  filters  permitting).  If  DF'.ETE  is  specified, 
each  designated  chain  in  <sourcesl>  is  also  deleted  from  its  chain  list. 
The  EXPAND  command  is  restricted  to  performing  inference  operations  on 
chains  existing  at  the  time  of  its  invocation,  i.e.,  it  will  not  perform 
any  inference  operations  on  chains  it  has  just  derived.  The  command 
terminates  when  (1)  the  empty  chain  is  generated,  i.e.,  a proof  has  been 
found,  (2)  all  the  specified  operations  have  been  performed,  or  (3)  the 
user  suspends  processing  of  the  command  by  typing  any  character.  If  no 
inference  operations  are  designated,  all  inference  operations  will  be  used. 

If  at  least  one  inference  operation  is  designated,  the  word  EXPAND  may  be 
omi tted. 

FOR  command.  The  (FDR  (DELETE]  <sources>  DO  <function>)  command 
applies  the  unary  LISP  function  <function>  to  each  chain  designated  by 
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<sources>.  If  DELETE  is  specified,  each  designated  chain  is  also  deieted 
from  i ts  cha in  list. 

PARAQET£RS  command.  The  (PARAMETERS  [ < i ndex>  ] ) command  is  used  to 
deciare  the  values  of  several  global  parameters.  If  an  <index>  (an 
arbitrary  LISP  atom)  is  specified,  it  designates  fcr  use  the  predefined  set 
of  parameter  values  associated  with  <index>.  If  an  <mdex>  is  not 
specified  or  <index>  has  no  previously  defined  meaning,  the  PARAMETERS 
command  asks  a series  of  questions  requiring  the  user  to  define  the  value 
for  each  parameter.  Parameters  set  by  the  PARAMETERS  command  include: 
whether  newly  generated  chains  are  to  be  printed,  the  format  in  which 
chains  are  to  be  printed,  whether  lemmas  are  to  be  generated,  and  whether 
subsumption  is  to  be  performed. 

E.QSTPR0CE55I  NG  command.  The  (POSTPROCESSING  [<index>])  command  is 
used  tc  declare  wr.at  postprocessing  operation  is  to  be  employed.  If  an 
< i ndex>  (an  arbitrary  LISP  atom)  is  specified,  this  <index>  designates  the 
postprocessing  operation  that  will  be  used.  Allowed  <index>s  include  UEAK- 
SAVE,  UEAK-OELETE,  STRONG-SAVE,  and  STRONG-DELETE,  designating  the 
postprocessing  operations  described  in  Section  1.  ’f  an  <index>  is  not 
specified  or  <index>  has  no  previously  defined  meaning,  the  POSTPROCESSING 
command  asks  a series  of  questions  requiring  the  user  to  designate  which 
action  among  a list  of  alternative  actions  is  to  be  taken  for  a given 
condition.  For  example,  the  POSTPROCESSING  command  may  ask  whether,  in  the 
case  of  an  A- 1 i ter  a I followed  by  an  identical  B-literal,  the  B-literal 
should  be  saved,  deleted,  or  deleted  with  the  reduction  operation  recorded 


prob/em  for  the  theorem  prover.  It  first  makes  the  chainlist  declarations 
specified  by  <dec I ara t i ons>  (if  fewer  than  two  chain  list  declarations  are 
specified,  up  to  two  default  declarations  will  be  made)  and  then  asks  the 
user  to  type  in  the  theorem  and  each  axiom.  The  theorem  is  stored  in  the 
first  declared  chain  list;  its  negation  and  the  axioms  are  sto-d  in  the 
second  dec  I area  chain  list.  The  input  format  for  the  theorem  and  axioms  is 
tne  same  as  was  used  in  the  description  of  the  linear  MESON  procedure 
except  that  prefix  form  for  preo.cate  and  function  symbols  is  required,  and 
(due  to  character  set  limitations)  a and  v are  omitted  and  ->  is 

substituted  for  Thus,  Pab  a Pba  -♦  a=b  v Qabx  is  typed  in  as  Pab  Pba  --> 

“ab  Qabx*  The  theorem  and  axioms  are  then  encoded  into  internal  list  form. 
The  PROBLEd  command  permits  the  user  to  save  the  encoded  axioms  so  that 
they  will  not  need  to  be  retyped  in  future  proofs  of  the  same  problem. 

SEARCH  ££0imand.  The  (SEARCH  [EXTENDI  [FACTOR]  [REDUCE!  [PEXTENDI 
[PREDUCEI  csourcesl > [BY  <sources2>]  [GIVING  <dest , nat i onsl>]  [AND 
<dest i nat i ons2>] ) repeatedly  deletes  the  first  chain  from  <sourcesl>  (a 
cha.n  with  lowest  order  function  value)  and  performs  on  it  each  designated 
inference  operation  with  the  chains  designated  by  <sources2>  as  second 

argument  to  binary  inference  operations  (extension  and  p-extens i on) . 
Derived  chains  will  be  stored  in  <dest  i nat i onsl>  and  lemmas  will  be  stored 
in  <dest inat ions2>  (chain  filters  permitting).  So  that  newly  generated 
chains  cun  be  used  as  input  to  inference  operations  by  the  SEARCH  command, 
<sourcesl>  and  <des t i na t i onsl>  will  ordinarily  specify  the  same  chain 

lists.  The  command  terminates  when  (1)  the  empty  chain  is  generated,  i.e., 
a proof  has  been  found,  (2)  <sourcesl>  is  empty  meaning  no  more  operations 
can  be  performed  and  no  proof  could  be  found  within  the  constraints  of  the 
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specified  operations,  initial  chains,  and  chain  filters,  or  (3)  the  ut  ar 
suspends  processing  of  the  command  by  typing  any  character.  If  no 
inference  operations  are  designated,  all  inference  operations  ui  I I be  used. 

SEARCH2  command.  The  (SEARCH2  [EXTENDI  [FACTOR]  [REDUCE!  [PEXTENDI 
[PREDUCEI  <sourcesl>  [BY  <sources2>I  [GIVING  <destinationsl>I  [mND 
<dest i nat i ons2>I ) repeatedly  deletes  the  first  chain  from  <sourcesl>  (a 
cham  with  lowest  order  function  value)  and  performs  on  it  the  first 
designated  inference  operation  in  the  order  function  value.  This  inference 
operation  is  then  deleted  from  the  order  function  value  and,  if  any 
inference  operations  remain  in  the  order  function  value,  the  chain  is 
reinserted  in  <sourcesl>  (now  with  the  numerical  value  associated  with  the 
next  inference  operation  as  the  numerical  value  of  the  chain  for  insertion 
into  the  sorted  chain  list).  Derived  chains  will  be  stored  in 
<dest i nat i onsl>  and  lemmas  will  be  stored  in  <dest i nat i ons2>  (chain  filters 
permitting).  So  thvit  newly  generated  chains  can  be  used  as  input  to 
inference  operations  by  the  SEARCH2  command,  <sourcesl>  and  <dest i nat i onsl > 
will  ordinarily  specify  the  same  chain  lists.  The  order  function,  using 
variables  of  the  SEARCh2  function,  will  construct  a list  of  inference 
operations  with  (in  the  case  of  binary  inference  operations)  second 
arguments  as  specified  in  <sources2>  for  derived  chains  as  they  are  stored. 
The  command  terminates  when  (1)  the  empty  chain  is  generated,  i.e.,  a proof 
has  been  found,  (2)  <sourcesl>  is  empty  meaning  no  more  operations  can  be 
performed  and  no  proof  could  be  found  within  the  constraints  of  the 
specified  operations,  initial  chains,  and  chain  filters,  or  (3)  the  user 
suspends  processing  of  the  command  by  typing  any  character.  If  no 
inference  operations  are  designated,  all  inference  operations  will  be  used. 
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TRA£i£F£R  command.  The  (TRANSFER  <sources>  [TO  <dest i nat i ons>] ) 
command  is  the  same  as  the  (COPY  DELETE  <sources>  [TO  <dest  i nat  i ons>]  ) 
command,  i.e.,  the  COPY  command  with  chain  deletion  specified. 
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3.  Performance  Study 


In  order  to  give  some  idea  of  the  performance  of  PSTP  with  some  simple 
search  strategies  and  to  make  some  points  about  relative  merits  of  some  of 
these  strategies,  the  results  of  PSTP  runs  cm  3 examples  using  4 strategies 
are  presented  here.  Results  are  compared  to  results  for  two  other  theorem 
proving  programs  tested  on  the  same  examples. 

Ih£  example?.  The  examples  are  taken  from  a comparative  study  of 
theorem  proving  strategies  used  by  QA3.G  by  Reboh  et  al  111]  (additional 
information  on  sources,  theory,  and  previous  uses  of  these  examples  are  in 
till);  the  same  examples  were  also  run  for  an  SL-reso  I ut  i on  theorem  prover 
(here  called  SLRTP)  by  Aubin  tl,2I.  The  examples  are  axiomatized  just  as 
for  QA3.G  with  an  occasional  suostitution  of  a disjunction  for  an 
no  I i cat  i on  and,  in  the  cases  of  unsat  i sf  i ab  I e sets  of  axioms,  the  use  of 
the  negation  of  one  of  the  axioms  as  the  theorem. 

inference  gperat  ions  used.  All  the  examples  were  run  with  extension 
as  the  only  rule  of  inference  except  the  NUfll  example  for  which  reduction 
was  also  necessary.  The  UEAK-DELETE  postprocessing  operation  was  used  for 
all  the  examples.  Its  use,  of  course,  permits  ground  factorization  and 
reduction.  In  some  examples  (BURSTALL,  SHORTBURST,  GR0UP1,  GR0UP2) , it  is 
readily  apparent  from  the  structure  of  the  problem  that  no  reduction  is 
possible  (since  every  chain  derived  from  the  theorem  has  only  positive 
literals  eliminating  any  possibility  of  matching  an  A-literal  with  a 
complementary  B- literal).  The  ANCES1  example  is  propositional  and  thus  the 
ground  reduction  in  the  UEAK-3ELETE  postprocessing  operation  is  sufficient. 
In  the  remaining  three  problems  (HAS-PARTS1,  HAS-PARTS2,  PRIM)  for  which 
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reduction  was  not  employed  (although  ground  reduction  was  used  in  each), 
the  use  of  the  reduction  operation  resulted  in  the  generation  of  no 
additional  chains.  Lemmas  were  not  generated  for  any  of  the  examples. 

S§9r str ateci i es  used.  The  strategies  used  are  characterized  by  4 
parameters:  length  multiplier,  level  multiplier,  length  maximum,  and  level 
maximum.  The  length  of  a chain  is  defined  to  be  its  number  of  B-l  iterate. 
This  is  consistent  with  the  notion  of  the  length  of  a clause  in  resolution 
theorem  proving  being  its  number  of  literals  since  in  a chain  A- literals 
record  ancestry  information  and  would  not  be  present  in  the  corresponding 
clausa  form.  The  level  of  a chain  is  defined  to  be  the  number  of  inference 
operatic. a employed  in  deriving  it  from  the  alleged  theorem  excluding  those 
operations  (ground  factorization  and  reduction)  automatically  performed  by 
tne  postprocessing  operation. 

The  SEARCH2  search  command  was  employed  with  projected  inference 
operations  ordered  according  to  the  minimum  values  of  a weighted  sum  of  the 
expected  length  and  level  of  the  result.  The  expected  length  of  a chain 
derived  by  extension  is  the  length  of  its  parent  chain  being  extended  plus 
the  length  of  the  axiom  minus  2.  The  expected  length  of  a chain  derived  by 
factorization  or  reduction  is  the  length  of  its  parent  chain  minus  1.  The 
actual  length  may  be  less  (but  never  more)  due  to  removal  of  B-literals  by 
the  accepting  transformation.  The  expected  and  actual  level  of  a chain  is 
the  level  of  its  parent  plus  1.  Only  inference  operations  whose  results 
have  expected  lengths  and  levels  not  exceeding  the  length  or  level  maxima 
will  be  attempted  (this  way  of  implementing  length  and  level  maxima  was 
also  used  by  QA3.6  and  SLRTP). 

Two  sets  of  length  and  level  multipliers  were  tried.  The  first  has  a 


length  multiplier  of  101  and  a level  multiplier  of  100  and  is  called  the 
101/100  strategy.  In  the  101/100  strategy,  the  projected  inference 
operation  with  highest  merit  is  one  uith  the  smallest  value  of  (100  times) 
the  sum  of  expected  length  and  level  of  the  result.  Ties  are  resolved  in 
favor  of  lesser  expected  length  (a  100/101  strategy  would  resolve  ties  in 
favor  of  lesser  expected  level).  (It  is  assumed  here  that  the  expected 
length  of  a chain  uill  never  exceed  100.)  The  most  important  thing  to  note 
about  the  101/100  strategy  is  that  it  is  essentially  the  same  as  Kowalski 
and  Kuehner’s  upper  diagonal  search  strategy  (5).  1 1 i s an  admissable 
strategy  Q0)  except  for  cases  where  the  postprocessing  operation  removes 
B-literals  by  ground  factorization  or  reduction.  First  proofs  d'scoverec' 
by  admissable  strategies  are  guaranteed  to  be  minimum  level  proofs. 

The  second  strategy  has  a length  multiplier  of  501  and  a level 
multiplier  of  100  and  is  called  the  501/100  strategy.  In  the  501/103 
strategy,  the  projected  inference  operation  with  highest  merit  is  one  with 
the  smallest  value  of  (103  times)  the  sum  of  the  expected  level  and  5 times 
the  expected  length  of  the  result.  Ties  are  again  resolved  in  favor  of 
lesser  expected  length.  By  multiplying  length  by  5 times  as  much  as  level, 
a strong  length  preference  strategy  is  produced.  The  501/100  strategy  is, 
of  course,  inadmissable  since  it  is  clearly  not  always  the  case  that  it 
requires  at  least  5 inference  operations  to  remove  a single  literal.  (For 
a strategy  to  be  admissable,  the  estimated  additional  cost  to  solution  must 
always  be  less  than  or  equal  to  the  actual  additional  cost  to  solution.) 

The  101/100  and  501/100  strategies  were  each  tried  with  (bounded)  and 
without  (unbounded)  length  and  level  maxima.  The  length  and  level  maxima 
used  were  those  used  by  QA3.G  wherever  oossible. 
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Statist! cs. 


The  performance  of  strategies  will  be  primarily 
characterized  by  the  “chains  generated"  statistic.  Here,  this  information 
is  represented  by  a 4-tuple:  the  first  component  is  the  number  of  chains 
retained;  the  second  component  is  the  number  of  acceptable  chains 
generated;  the  third  component  is  the  total  number  of  chains  generated;  the 
fourth  component  is  the  number  of  attempted  inference  operations.  The 
number  of  retained  chains  is  the  number  of  acceptable  chains  minus  the 
number  of  chains  eliminated  by  subsumption,  function  depth  tests,  etc.  No 
such  processes  mere  used  to  eliminate  chains  in  this  experiment,  so  the 
number  of  retained  chains  is  always  equal  to  the  number  of  acceptable 
chains.  The  total  number  of  chains  generated  is  the  number  of  acceptable 

chans  plus  the  number  of  non-acceptab I e chains  generated.  These 
statistics  and  the  time  figures  referred  to  below  are  automatically 
accumulated  by  PSTP  and  printed  out  when  a proof  is  found. 

Nearly  comparable  statistics  are  presented  where  available  for  QA3.6 
and  SLRTP  (except  QA3.6  statistics  refer  to  clauses  rather  than  chains). 

Best  and  mean  performance  figures  are  presented  for  QA3.B  on  each 
example.  For  QA3.G,  the  number  of  retained  clauses  is  the  number  of 

retained  clauses  after  subsumption  and  function  depth  tests;  the  number  of 
acceptable  clauses  is  computed  as  the  number  of  successful  resolutions  and 
factorings;  the  number  of  attempted  inference  operations  is  computed  as  the 
number  of  attempted  resolutions  and  factorings.  The  proportion  of  tested 
QA3.B  strategies  discovered  a proof  is  given  on  the  same  line  as  the 

mean  performance  of  QA3.B  strategies;  unsuccessful  strategies  were  excluded 
in  computing  the  means. 

Performance  figures  are  presented  for  SLRTP  where  the  set  of  support 
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for  the  refutation  was  the  negation  of  the  theorem  in  the  PSTP  proof  on 


each  example.  Due  to  the  similarity  of  operations  and  terminology  between 
the  linear  MESON  procedure  and  the  inference  system  for  SLRTP,  SL- 
resolution  [5],  we  will  here  present  a brief  description  of  SL-reso I ut i on. 

SL-r  eso  I ut  i on,  a refutation  procedure,  can  be  viewed  as  a variant  of 
model  elimination  without  equality  with  the  following  features.  (1)  The 
capability  for  reordering  B-literals  at  the  end  of  a chain  is  formalized  in 
the  form  of  a literal  selection  function  which  designates  the  literal  to  be 
extended  on  in  succeeding  extension  operations.  (2)  Factorization  is  a 
required  operation  for  completeness  in  SL-reso I ut i on  since  the  equivalent 
of  the  STRONG-SAVE  postprocessing  operation  is  employed.  The  model 
elimination  factorization  and  reduction  operations  are  combined  into  the 
SL-reso I ut i on  reduction  operation.  (3)  SL-reso I ut i on  requires  a fully 
factored  input  set  of  clauses,  i.e.,  every  non-tautologous  factor  of  an 
input  axiom  must  also  be  input  (or,  as  in  SLRTP,  derived).  A benefit  of 
this  is  that  SL-reso  I ut  i on  reduction  operations  need  never  be  performed 
with  the  leftmost  involved  literal  being  or  following  the  last  A-literal  of 
the  chain.  (A)  Any  B-literal  following  the  last  A-literal  of  the  chain  is 
a candidate  for  removal  by  the  reduction  operation,  not  just  the  rightmost 
as  in  model  elimination,  (5)  Upper  diagonal  search  is  the  prescribed 
search  strategy  for  SL-resol ut  ion. 

For  SLRTP,  the  number  of  retained  chains  is  the  number  of  retained 
chains  after  function  depth  tjsts  and  subsumption  (subsumption  is  only  used 
in  eliminating  redundant  axiom  chains  or  their  factors  during  the  process 
of  generating  a fully  factored  input  set  of  axioms);  the  number  of 
acceptable  chains  is  computed  as  the  number  of  successful  extensions, 
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reductions,  and  factorings  (used  only  for  generating  a fully  factored  input 
set  of  axioms);  the  number  of  attempted  inference  operations  i s computed  as 
the  number  of  attempted  extensions,  reductions,  and  factorings.  The  GR0JP1 
and  GR0UP2  example  statistics  are  taken  from  123. 

Line  stat i st i cs.  The  "search  time"  statistic  represents  the  time 
spent  in  searching  for  a proof  by  a compiled  version  of  PSTP;  it  excluues 
time  s^ent  in  inputting  the  problem,  outputting  of  final  statistics  and 
proof,  and  garbage  collection,  although  it  does  inc'ude  time  reauired  for 
some  trace  output  during  the  search.  Search  time  is  the  only  widely 
variable  component  of  total  time  to  solution  with  problem  inpuf  and 
statistics  and  proof  output  time  relatively  constant  and  small.  Although 
PSTP  is  conservative  of  storage  (performing  LISP  CONS  operations  only  when 
necessary  when  instantiating  chains)  and  tnerefore  ordinarily  requires  few 
garbage  collections,  garbage  collection  time  is  excluded  because  (1)  time 
consuming  garbage  collections  occurring  at  random  times  in  the  search  for  a 
proof  tend  to  randomize  the  time  statistics  especially  for  short  searches 
(this  problem  could  be  overcome  by  always  starting  a search  for  a proof 
immediately  after  a garbage  collection)  and  (2)  frequency  of  garbage 
collection  is  dependent  on  the  amount  of  storage  available  (with  infinite 
storage,  there  need  not  be  any  garbage  collections).  Nearly  all  the  proofs 
presented  here  were  found  with  about  25000  words  available  for  storing 
chains  and  most  were  found  with  no  garbage  collections. 

Time  statistics  should  not  be  used  for  comparison  among  strategies 
used  by  different  theorem  provers  without  considerable  caution  and  more 
information  than  is  usually  available.  Such  statistics  are  of  course 
influenced  by  the  machine  and  operating  system  used,  language  and  coding  of 
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the  theorem  proven,  whether  compiled  (SLRTP,  PSTP)  or  interpreted  (QA3.G), 
sper i a I conditions  applying  to  the  operation  of  the  theorem  prover  (e.g., 
tracing),  and  some  randomness  in  the  times  themselves  (such  randomness, 
attributable  to  variable  load  on  the  time  sharing  system,  is  visible  in 
some  anomalies  in  the  statistics  presented  here). 

B-£-£  a i t $ . Four  primary  observations  can  be  made  from  the  results  of 
the  experiment  presented  here:  (1)  PSTP  performs  competitively  with  QA3.6 
and  SLRTP,  (2)  the  501/133  strategy  performs  better  than  the  101/100 
strategy  (for  these  examples),  (3)  the  501/103  strategy  is  relatively 
insensitive  to  length  and  level  bounds,  the  181/133  strategy  is  much  more 
sensitive,  and  (4)  elimination  of  some  implicative  forms  of  the  axioms  can 
result  in  improved  per f ormance. 

The  basis  tor  comparison  of  the  results  of  PSTP  and  GA3.6  is  the 
number  of  acceptable  chains  generated  (equals  the  number  of  chains 
retained)  for  PSTP  versus  tne  number  of  acceptable  clauses  generated 
(equals  the  number  of  successful  reso'utions  and  factorings)  for  QA3.6. 
This  is  a fairer  comparison  than  one  using  the  number  of  retained  clauses 
for  GA3.6  since  QAo.b  eliminated  clauses  by  function  depth  maxima  and 
subsumption.  Even  this  comparison  is  still  somewhat  unfair  to  PSTP  since 
if  function  depth  tests  and  subsumption  had  not  been  used  in  GA3.G,  the 
number  of  generated  clauses  would  presumably  have  been  larger  since 
eliminated  clauses  couiu  now  act  as  parent  clauses  ,n  additional 
i nf erences. 

Using  this  basis  for  comparison,  the  unbounded  581/138  strategy  (the 
strategy  we  prefer  for  reasons  given  below)  performed  better  than  the 
average  of  QA3.6  strategies  which  found  a proof  in  all  the  examples  except 
PRIM,  GR0UP1 , and  GRGUP2. 
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In  the  PRIM  example,  the  unbounded  501/100  strategy  performed  only 
s.ightly  worse  than  the  average  of  QA3.G  strategies. 

In  the  GR0UP1  example,  the  absolute  difference  in  performance  is  small 
even  if  the  number  of  chains  generated  by  PSTP  is  double  the  number  of 
clauses  generated  by  QA3.G.  In  view  of  the  fact,  for  example,  that  by 
reversing  the  order  of  presentation  of  the  axioms  to  PSTP  can  cause  the 
performance  of  PSTP  to  exceed  that  of  QA3.6,  we  tend  to  regard  this 
difference  as  being  relatively  insignificant. 

The  difference  in  the  case  of  GRQUP2  is  much  more  serious  and  has  a 
rather  different  explanation.  Where  the  formulation  of  the  GR0JP2  example 
has  several  unit  axioms  and  two  4-literal  associative  axioms,  the  use  of  a 
length  maximum  value  of  3 can  be  seen  to  be  extremely  restrictive.  In 
resolution  terms,  this  length  maximum  requires  that  only  units  be  resolved 
against  the  associative  axioms,  and  in  the  case  of  GR0UP2  if  the  negation 
of  the  theorem  19  used  as  the  set  of  support,  the  length  maximum 
automatically  restricts  any  tested  strategy  to  a further  refinement  of  unit 
resolution  in  which  only  the  negation  of  the  theorem  can  be  directly 
resolved  against  the  associative  axioms.  PSTP  was  tested  with  a variant  of 
the  GR0UP2  example  in  which  axiom  3 was  reordered  so  that  a proof  meeting 
the  length  maximum  value  of  3 restriction  existed.  Gn  this  example,  the 
unbounded  101/100  strategy  generated  435  chains;  all  the  other  strategies 
generated  23.  We  therefore  feel  that  the  better  performance  of  QA3.G  on 
this  problem  was  more  attributable  to  the  restrictive  length  maximum  than 
to  an  intrinsic  inferiority  of  PSTP. 

The  comments  about  the  restrictive  length  bound  used  by  QA3.G  in  the 
GR0UP2  example  can  be  extended  to  several  other  examples.  The  BURSTALL, 
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SHORTBURST,  HAS-PARTS1,  HAS-PARTS2,  PRIM,  ANCES1 , GROUP1 , and  GR0UP2 


examples  all  had  very  restrictive  length  maxima,  in  every  case  set  at  or 
below  the  minimum  value  required  for  PSTP  to  discover  a proof.  Level 
maxima  were  often  similarly  restrictive  although  we  perceive  this  to  be 
much  less  important  in  reducing  the  size  of  the  search  in  the  non-depth 
first  search  strategies  tested.  Ue  feel  that  use  of  such  restrictive 
length  and  level  (especially  length)  bounds  invalidates  the  results  of  til) 
to  a degree,  since  their  use  imposes  severe  limitations  on  the  structure  o f 
the  search  space.  In  this  restricted  search  space,  tests  of  different 
strategies  may  fail  to  discriminate  between  strategies,  or  unfairly 
discriminate  between  them. 

Given  the  similarity  of  SL-reso I ut i on  and  the  linear  HES0N  procedure, 
one  would  anticipate  substantial  similarity  in  the  results  for  PSTP  using 
the  bounded  101/100  strategy  (upper  diagonal  search)  and  SLRTP.  For  G of 
the  examples  (BURSTAll,  SHORTBURST,  HAS-PARTS1,  PRIM,  ANCES1,  and  NUni), 
the  results  agree  closely.  Differences  emerge  for  the  remaining  3 
examples.  Ue  don’t  know  why  PSTP  did  so  much  worse  than  SLRTP  on  the  HAS- 
PARTS2  example.  In  the  GRGUP1  example,  use  of  a fully  factored  input  set 
of  clauses  was  clearly  beneficial  to  SLRTP  since  the  very  short  proof  could 
be  shortened  further  by  using  a factoreu  form  of  one  of  the  associative 
axioms.  PSTP  with  factorization  could  not  match  the  SLRTP  results,  since 
extension  by  the  associative  axiom  followed  by  factorization  counted  as  2 
inference  operations  in  computing  the  level  of  the  resulting  chain  (whose 
value  is  used  to  compute  the  order  function  value)  whereas  extension  by  the 
factored  associative  axiom  by  SLRTP  counts  as  only  1 inference  operation. 
(PSTP  could  be  made  to  equal  SLRTP’s  performance  on  this  example  by 
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inputting  a fully  factored  set  of  axioms,  a perfectly  legal  operation, 
although  unnecessary  for  completeness.)  In  the  case  of  the  GR0UP2  example, 
fcr  which  SLRTP  failed  to  find  a solution,  both  the  bounded  and  unbounded 
i0i/100  strategies  in  PSTP  showed  relative  difficulty  in  discovering  a 
solution.  In  SLRTP,  this  difficulty  was  exacerbated  by  the  very  feature 
wh.ch  aided  the  quick  solution  of  the  GR0UP1  example:  mandatory 

factor izat ion.  PSTP  with  factorization  and  the  STRONG-DELETE 
postprocessing  operation  (resulting  in  an  inference  system  very  similar  to 
Sl-t eso i ut i cn)  failed  to  find  a proof  with  the  unbounded  101/100  strategy 
after  1507  chains  were  generated,  discovered  a proof  while  generating  410 
chains  with  the  unbounded  501/100  strategy,  and  discovered  a proof  while 
generating  458  chains  with  each  of  the  bounded  strategies.  The  proofs 
discovered  were  the  same  as  those  discovered  without  factorization. 

He  believe  the  detrimental  effects  of  factorization  as  demonstrated  in 
the  GRQUP2  example  results  are  more  typical  than  the  beneficial  effects 
illustrated  in  the  GROUPi  example.  In  our  experience,  even  in  cases  where 
factorization  does  shorten  a proof  (as  it  did  not  in  the  GR0UP2  example), 
the  proliferation  of  highly  instantiated  chains  caused  by  the  use  of 
factorization  still  often  outweighs  the  benefits.  (These  negative  comments 
Clearly  refer  only  to  general  factorization  where  literals  must  be  unified; 
factorization  in  the  ground  case  is  clearly  beneficial  and  is  included  in 
the  postprocessing  operations  we  used  here.)  Should  future  experience 
prove  this  judgment  about  factorization  wrong,  the  linear  MESON  procedure 
still  permits  factorization  as  a legal  though  optional  operation. 

Another  point  can  be  made  here  concerning  SLRTP’s  efforts  to  discover 
a solution  to  the  GR0UP2  example.  SLRTP  uses  a literal  selection  function 
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to  designate  which  literal  of  each  derived  chain  is  to  be  used  in  future 
extension  operations.  The  only  literal  selection  function  tested  was  the 
function  which  aluays  selects  a literal  which  has  the  fewest  matching 
literals  among  the  axioms.  This  has  the  obviously  desirable  characteristic 
of  reducing  the  branching  rate  of  the  search  tree  since  the  selected 
literal  has  the  fewest  matches  among  the  axioms  and,  further,  removal  of 
the  selected  literal  after  some  in,erence  operations  will  usually 
instantiate  the  remaining  literals  and  reduce  the  number  of  literals  among 
the  axioms  matching  them.  However,  this  literal  selection  function  is,  in 
the  case  of  problems  with  structure  similar  to  the  GROl  P2  example, 
inconsistent  with  the  use  of  length  maxima.  !n  GR0UP2,  for  example,  the 
literal  selection  function  will  show  a preference  for  literals  capable  of 
being  extended  upon  only  by  the  associative  axioms  (since  any  positive 
literal  matches  the  consequent  of  the  associative  axioms,  any  literal 
matching  a unit  axiom  also  matches  the  associative  axioms).  Thus,  the 
effect  of  the  use  of  this  literal  selection  function  is  to  increase  the 
length  of  chains  appearing  in  a deduction  possibly  requiring  the  increase 
of  the  length  maximum  used. 

One  final  point  remains  about  the  comparison  of  results  between  PSTP 
ard  SLRTP.  This  concerns  the  very  small  number  of  attempted  inference 
operations  by  SLRTP.  This  is  due  to  the  use  of  a literal  classification 
tree  which  automatically  selects  out  likely  matches  for  literals  to  be 
extended  upon  from  among  the  literals  in  the  axioms.  The  extension 
operation  is  only  attempted  for  axioms  containing  literals  selected  by  the 
literal  classification  tree.  This  probably  represents  a fairly  small 
(though  real)  saving  in  computational  effort  since  one  must  count  the  cost 
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Of  creating  and  accessing  the  literal  classification  tree  and  the  cost 

edvnd  3 that  of  attempting  unifications  destined  to  fail,  usually  a fairly 

quick  operation.  The  real  benefit  of  use  of  the  literal  classification 

thee  is  the  elimination  of  the  multiple  attempts  at  unifying  literals  that 

mould  ordinarily  result  from  use  of  the  literal  selection  function 

requiring  discovery  of  the  number  of  matches  for  a literal  among  literals 
in  the  axioms. 

in  comparing  the  four  strategies  tested  by  PSTP  among  themselves,  one 
firs,  discovers  that  the  501/100  strategy  invariably  performed  as  mel,  as 
or  better  (nan  the  101/100  strategy  for  the  same  choice  of  length  and  level 
maxima.  This  is  especially  true  of  me  results  for  the  BURSTALL,  GR0UP2, 
and  PRIM  examples  in  the  absence  of  length  and  level  maxima.  A further 
demonstration  of  the  superiority  of  the  501/100  strategy  is  its  relative 
insensitivity  to  length  and  level  bounds.  Only  in  the  BURSTALL  example  did 
«he  bounded  501/103  strategy  perform  significantly  better  than  the 
unded  strategy.  Also,  in  the  PRIM  and  GR0UP2  examples,  the  addition  of 
J and  level  bounds  actually  degraded  the  performance  of  the  501/100 
strategy  since  the  bounds  excluded  proofs  discovered  by  the  unbounded 
strategy.  In  contrast,  performance  of  the  101/100  strategy  mas  often 
improved  by  the  addition  of  length  and  level  bounds,  bu,  (as  stated  above) 
improving  upon  the  performance  of  the  501/100  strategy.  The 
demonstrated  insensitivity  0f  the  501/100  strategy  to  the  addition  of 
length  and  level  bounds  seems  especially  significant  in  viem  of  the  often 
extreme  restrictiveness  of  the  bounds  tested. 

Due  to  its  general  I j good  performance  and  lack  of  improvement  with  the 

addition  of  bounds,  we  regard  the  unbounded  501/100  strategy  as  the  best 
among  those  tested. 
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Ue  feel  generally  that,  provided  it  performs  adequately,  i complete 
(e.g.,  length  preference)  strategy  like  the  unbounded  501/100  strategy  is 
to  be  preferred  to  an  incomplete  (e.g.,  length  bounded)  strategy  like  the 
bounded  101/100  strategy,  e'  2n  if  the  latter,  uith  appropriate  choice  of 
bounds,  can  often  match  the  performance  of  the  former. 

Finally,  we  merely  note  that  judicious  elimination  of  various 
implicative  forms  of  the  axioms  can  result  in  significantly  improvec 
performance  as  demonstrated  in  the  results  for  the  HAS-PARTS1,  HAS-PARTS2, 
and  PRIM  examples.  Of  course,  this  elimination  of  implicative  forms  of  the. 
axioms  destroys  the  completeness  property  of  the  linear  MESON  procedure. 
However,  this  controlled  incompleteness  may  be  desirable  in  cases  where 
significant  improvement  in  performance  results.  Completeness  could  be 
preserved  and  nearly  the  same  effect  gained  by  presenting  PSTP  with  all  the 
implicative  forms  of  the  axioms,  but  (via  the  order  funct:on  definition) 
giving  PSTP  a strong  preference  for  using  one  instead  of  another. 

One  feature  of  the  linear  HESON  procedure  not  previously  discussed  is 
the  length  of  its  proofs.  It  is  characteristic  of  linear  theorem  proving 
strategies  that  they  require  longer  proofs  than  some  other  strategies. 
This  and  past  studies  11,4]  indicate  that  linear  strategies  can  overcome 
this  increased  proof  length  and  perform  compet ' t i ve I y with  other 
procedures.  The  linear  resolution  strategy  tested  in  QA3.G  was  less 
successful  since  the  special  chain  rejection  criteria  of  variants  of  the 
model  elimination  procedure  were  not  used. 

Uhile  the  length  of  a proof  is  one  measure  of  its  complexity,  we  feel 
that  the  increased  length  of  linear  HESON  procedure  proofs  is  not  a great 
disadvantage  in  terms  of  readability.  The  problem-reduction  method 
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oriented  form  of  such  proofs  often  makes  them  more  comprehensible  than 
ordinary  resolution  proofs  relying  on  converging  lines  of  deduction 
resulting  in  a refutation. 
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4.  Summary 


We  have  presented  the  linear  MESON  procedure,  a procedure  we  feel  to 
be  one  of  the  most  natural  available  systems  of  complete  inference  in  the 
first  order  predicate  calculus  due  to  its  linear  format  and  relationship  to 
goal -subgoal  trees.  It  also  has  advantages  in  the  capability  for  inputting 
multiple  implicative  variants  of  individual  axioms  so  hat  individual 
variants  can  be  differentially  treated  (or  ignored).  Another  advantage  of 
the  linear  MESON  procedure  as  compared  with,  for  example,  SL-reso I ut i on  is 
the  optional  nature  of  the  factorization  operation.  Although  the  point  is 
not  elaborated  upon  here,  it  is  our  observation  (also  made  by  Fleisig  et  al 
(43)  that  factorization  (except  ground  factorization)  is  usually  harmful 
and  results  in  instantiating  chains  too  greatly. 


The  Programmable  Strategy  Theorem  Prover  (PSTP)  is  a theorem  proving 
program  using  the  linear  MESON  procedure  as  its  inference  system. 
Especially  significant  features  of  PSTP  are  the  general  capabilities  for 
specifying  information  to  be  computed  or  retained  about  chains,  for 
specifying  which  chains  are  to  be  retained  or  manipulated  by  a given 
command,  and  for  specifying  the  order  in  which  inference  operations  are  to 
be  performed  in  fully  automatic  searches  for  a proof. 

Ue  have  presented  Me  results  for  PSTP  solutions  of  9 examp  ..s 
previously  tested  in  two  other  theorem  proving  studies.  From  these  results 
we  concluded  that  PSTP  performed  competitively  as  compared  with  the  other 
tested  theorem  provers.  We  feel  that  the  potential  significance  of  PSTP  is 
not  that  it  perform  spectacularly  using  simple  search  strategies  such  as 
those  tested  (it  doesn’t),  but  that  it  provides  an  inference  system  and 
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system  features  which  facilitate  the  user  specification  of  more  complex  and 
effective  search  strategies  and  chain  elimination  criteria. 

He  also  demonstrated  empirically  the  inferiority  of  the  diagonal 
search  strategy  to  similar  strategies  which  have  a stronger  length 
preference  component  and  that  such  length  preference  oriented  strategies 
all  but  eliminate  the  need  for  length  maxima  for  problems  of  this  level  of 
complexity.  He  prefer  search  strategies  that  have,  for  example,  length 
preference  built  in  to  the  added  imposition  of  length  bounds:  the  use  of 
preference  strategies  in  the  absence  of  bounds  results  in  complete 
strategies  guaranteed  to  find  a solution  if  one  exists. 

He  have  also  criticized  some  of  the  methodology  used  in  previous 
theorem  proving  studies.  Results  in  such  studies  are  often  heavily 
dependent  on  the  length  and  level  (especially  length)  bounds  used  in 
restricting  the  search  for  a solution.  In  consequence,  success  in  finding 
a solution  in  reasonable  time  and  space  is  often  more  attributable  to  the 
bounds  used  than  the  tested  stra  egy.  Thus,  such  results  fail  to 
adequately  discriminate  between  different  theorem  proving  procedures.  He 
urge  future  studies  test  theorem  proving  procedures  in  the  absence  of 
artificially  imposed  bounds. 
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Examples 


1.  BURSTALL  Example 
Ax lomsi 

1.  has (pi , ass ( j , n0) ) 

2.  f o I lows(p2,pl) 

3.  has (p2,ass (x  ,nl) ) 

4.  labe  Is  ( loop,p3) 

5.  fo  I lows (p3,p2) 

6.  has (p3 , I f then  (equa I ( j , n)  , p4) ) 

7.  has (p4, go  to  (out) ) 

8.  fol lows  (p5,p4) 

9.  (o I lows (p6,p3> 

10.  ha  (p6,ass <k , t imes <n2,k ) ) ) 

11.  fol lows  (p7,p6) 

12.  has (p7, ass  ( j ,plus ( j ,nl) ) ) 

13 . foil  ows  <p8,  p7) 

14.  has(p8,goto(loop)) 

15.  foil ows (xp, yp)  * succeeds (xp,yp) 

16.  succeeds(xp,zp)  a succeeds (zp,yp)  ■*  succeeds (xp,yp> 

17.  has(xp,goto(zp))  a I abe I s (zp, yp)  w succeeds (yp, xp) 

18.  has  (xp,  i f then(ze,yp) ) ■*  succeeds  (yp,xp> 

Theorem: 


19.  succeeds (p3,p3) 

2.  SHORTBURST  Example 
Axioms: 


1.  I abe  Is  ( loop,p3) 

2.  has (p3 , i f then  (equal ( j ,n) ,p4) ) 

3.  has (p4, go  to  (out) ) 

4.  fol lows(p5,p4) 

5.  fol lows(p8,p3) 

6.  has (p8, goto ( loop) ) 

7.  f o I I ows (xp, yp)  + succeeds(xp,yp) 

8.  succeeds  (xp.zp)  a succeeds (zp,yp)  t succeeds(xp.yp) 

9.  has  (xp,  goto  (zp) ) a I abe  I s (zp,  yp)  -*  succeeds  (yp,  xp) 

10.  has  (xp,  i f then (ze, yp) ) -•  succeeds(yp,xp) 

Theorem: 


11.  succeeds  (p3,p3) 

3.  HAS-PARTS  Example  1 
Ax i oms : 


1.  in(John,boy> 

2.  in(x,boy)  •*  i n <x , human) 

3.  hp(x,xm,y)  in <sk  1 <x,y,z,xm,xn)  ,y)  v hp (x,  t (xm, xn) , z) 

4.  hp(x,xm,y)  ^ -hp (sk 1 (x, y , z, xm, xn) , xn, z)  v hp (x, t <xm,xn) ,z) 

5.  i n ( x , hand ) ■*  hp  (x,n5,  f i ngers) 

6.  in(x, human)  ■*  hp(x,n2,arm) 

7.  in(x,arm)  + hp(x,nl  hand) 

Theorem: 


8.  hp(John, t (n2,nl) ,hand) 
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Ex  amp l at 

4.  HAS-PARTS  Examp  I#  2 
Axioms: 

1.  in(John,boy) 

2.  in(x,boy)  ■*  in(x, human) 

3.  hp(x,xm,y)  •*  in (sk 1 (x,y , z, xm, xn) , y>  v hp  (x , t (xm,xn) , z) 

4.  hptx,xm,y)  ■*  -hp  tsk  1 (x,  y , z , xm,  xn) , xn,  z)  v hp  (x,  l (xm,  xn) , z> 

5.  In(x,hand>  ■*  hp(x,n5,  f ingers) 

6.  in(x, human)  4 hp(x,n2,arm> 

7.  in(x,arm)  ■*  hp (x, nl, hand) 

Theoram: 

8.  hp (John, t (t  (n2,nl) ,n5) , f ingers) 

5.  PRIH  Example 
Ax i oms : 

1 . Dxx 

2.  Oxy  a Oyz  •>  Oxz 

3.  Px  v Dg(x)x 

4.  Px  v Lnly  (x) 

5.  Px  v Lg(x)x 

6.  Lnlx  a Lxa  Pf  (x) 

7.  Lnlx  a Lxa  •*  Of  (x):< 

8.  Lnla 

Theorem: 

<9.  -Px  v -Oxa  negation  of  theorem) 

10.  Pxl  a Oxla 

6.  ANCES1  Example 
Axioms: 

1 . -J  v A v H 

2.  K v H v J 

3.  -K  v H v J 

4.  -A  v -B 

5.  -A  v B 

6.  -H  v -C 

Theorem: 

7.  H a -C 

7.  Num  Example 
Axioms: 

1.  Px  a riyzu  a Dxu  4 Oxy  v Oxz 

2.  Mxxs(x) 

3.  Hxyz  ■»  flyxz 

4.  Hxyz  4 Oxz 

5.  flas(c)s(b) 

6.  Pa 

Theorem: 


7.  Oab 
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Examples 


8.  GR0UP1  Example 
Ax i oms : 

1.  Pxyu  a Pyzv  A Pxvu  Puzu 

2.  Pxyu  a Pyzv  a Puzu  -•  Pxvu 

3.  Pg(xy)xy 

4.  Pxh(xy)y 

5.  Pxy * (xy) 

Theorem: 

<6.  -Pj(x)xj(x)  negation  of  theorem) 
7.  Pj(xl)xlj(xl) 

9.  GR0UP2  Example 
Ox i oms: 

1.  Pxex 

2.  Pexx 

3.  Pxyu  a Pyzv  a Puzu  Pxvu 

4.  Pxyu  a Pyzv  a Pxvu  Puzu 

5.  Pxxe 

6.  Pabc 

Theorem: 


7,  Pbac 


Length  Level  Length  Level  Proot 
flulti-  flu  1 1 i - flex  i mum  flax  i Kuo  Code 


-flllir  . -Ol ier 

1 BURSTALL  Example 

ioi  iaa 

A 

sai  iaa 

_ 

B 

ioi  iaa 

2 

12 

A 

sa:  iaa 

2 

12 

A 

QA3.6  best 

2 

12 

QA3.6  mean  16/13  2 

12 

SLRTP 

3 

13 

SHORTBURST  Examp |, 
101  100 

• 

A 

501  188 

_ 

_ 

A 

101  100 

2 

ia 

A 

501  iaa 

2 

la 

A 

QA3.6  best 

2 

10 

QA3.6  mean  14/14  2 

ia 

SLRTP 

3 

ia 

HAS-PARTS  Example 
Impi icat ive  form 

1 
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axioms: 

ioi  iaa 

- 

A 

501  iaa 

_ 

A 

101  100 

3 

ia 

A 

sei  iaa 

3 

10 

A 

Disjunctive  for* 
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101  100 

_ 

_ 

A 

sai  iaa 

_ 

A 

101  loe 

3 

ia 

A 

sei  iaa 

3 

ia 

A 

QR3.6  best 

2 

la 

QA3.6  moan  6/6 

2 

la 

SLRTP 

2 

la 

fAS-PARTS  Example  2 

Impi icat ive  form 

for  axioms: 

101  100 

— 

A 

501  ieo 

- 

A 

101  100 

3 

10 

A 

501  iaa 

3 

la 

A 
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for  axioms: 

101  180 

_ 

A 

501  iaa 

_ 

B 

101  iaa 

3 

ia 

A 

501  108 

3 

ia 

B 

QA3.6  best 

2 

le 

QA3.6  mean  6/7 

2 

la 

SLRTP 
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7/ 

7/ 
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10/ 
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8/ 
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9/ 
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14/ 
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Proofs 

1.  BURSTflll  Example 
Proo'  fl 

19.  s icceeds  ( p3 , p 3 ) 

20.  tsuLceeds(p3,p3)J  succeeds (p3,xl)  succetds(xl,p3) 

21.  tsucc»eds(p3,p3)J  succeeds(p3,xl)  (succeeds(xl,p3>]  foil ous (xl, p3 ) 

22.  tsucceeds(p3,p3>]  succeeds  <p3,p6) 

23.  [succeeds(p3,p3)J  (succeeds(p3,p6)l  succeeds (p3,xl)  succeeds (xl,p6) 

2*.  :sL-'''-9eds<p3,p3)]  [succeeds  (p3,p6)l  succeeds(p3,xl)  tsucceeds(xl.p6)] 

fol  lous(xl,p6) 

25.  [succeeds  (p3,p3)l  (succeeds (p3, p6) ) succeeds (p3,p7) 

26.  (succeeds(p3,p3)]  (succeeds(p3,p6)]  (succeed-  (p3,p7)l  succeeds  (p3,xl) 
succeeds (xl,p7) 

27.  [succeeds(p3,p3)]  [succeeds(p3,p6)l  tsucceeds(p3,p7)J  succeeds(p3,xl) 
[succeeds (xl,p7)]  fol lows(xl,p7) 

28.  [succeeds  (p3,p3)J  tsuccieds<p3,p6>I  (succeeds(p3,p7)]  succeeds (p3,p8) 

29.  [succeeds  (p3,p3)l  (succeeds(p3,p6)l  tsucceeds(p3,p7) J [succeeds(p3,p8)] 
has (p8, go  to (xl I ) labe Is (xl,p3) 

30.  Esucceeds(p3,p3)]  (succeeds(p3,p6)]  (succeeds (p3,p7)]  [succeeds(p3,p8)l 
has (p8, goto ( loop) ) 

31.  empty 

Proof  B 
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extend  by  16 

extend  by  16 

extend  by  15 
extend  by  9 

extend  by  15 
extend  by  11 
extend  by  16 

extend  by  15 
extend  by  13 

extend  by  17 
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extend  by  14 
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theorem 
extend  by  8 
extend  by  7 
extend  by  5 
extend  by  9 
sxtend  by  1 
extend  by  6 


11.  succeeds (p3,p3) 

12.  Isucceeds(p3,p3)}  succeeds (p3,xl)  succeeds(xl,p3) 

13.  (succeeds(p3,p3)J  succeeds <p3,xl)  [succeeds (xl,p3>)  fo  I lows  (xl,p3> 

14.  (succeeds (p3,p3)l  succeeds(p3,p8) 

15.  (s‘jcceeds(p3,p3))  tsucceeds(p3,p8) J has(p8,goto(xl> ) labels  (xl,p3> 

16.  (succeeds  (p3,p3)J  [succeeds (p3,p8)l  has(p8,goto(loop)) 

17.  empty 


19.  succeeds(p3,p3) 

20.  (succeeds(p3,p3)l  succeeds  (pj,xl)  succeeds (xl,p3) 

21.  (succeeds(p3,p3)l  succeeds(p3,xl)  [succeeds(xl,p3)J  succeeds  (xl,  x2) 
succeeds (x2,p3) 

22.  (succeeds (p3, p3) ] succeods(p3,xl)  tsucceeds(xl,p3>)  succeeds (xl,x2> 
[succeeds (x2,p3)l  fo  I lows <x2, p3) 

23.  [succeeds (P3,p3)]  succeeds <p3, xl)  (succeeds (xl,p3) ] succeeds (xl,p6) 

24.  [succeeds (p3,p3)J  succeeds <p3, xl)  (succeeds(xl,p3)J  [succeeds(xl.p6>] 
fol lous(xl,p6) 

25.  (succeeds (p3,p3)J  succeeds (p3,p7) 

26.  [succeeds (p3,p3)]  [succeeds (p3, p7)J  succeeds (p3,xi)  succeeds (xl,p7> 

27.  (succeeds (p3, p3) ) [succeeds <p3, p7) J succeedstp3,xl)  [succeeds (xl,p7) ] 
fol lous(xl,p7) 

28.  [succeeds (p3,p3)]  (succeeds <p3,p7)l  succeeds (p3,pB) 

29.  [succeeds  (p3, p3) ] [succeeds <p3,p7)  J r^i>cceeds(p3,pB))  has (pB.qoto (xl) ) 
labe  I s (xl ,p3) 

30.  (succeeds(p3,p3)]  (succeeds(p3,p7)J  [succeeds  (p3,p8) ) has(p8,goto(loop)) 

31.  empty 

2.  SHORTBURST  Example 
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extend  by  15 
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Proof* 


3.  HAS-PARTS  Example  1 
Proof  A 


8. 

9. 

10. 

11. 

12. 

13. 

14. 


hp  (John , t (n2 , nl) .hand) 

Ihp  (John,  t (n2, ril ) .hand)]  hp(John,n2,xl)  hp(sk  1 (John, xl, hand, n2,nl)  ,nl, hand) 
(hpfJohn, t (n2,nl) .hand)]  hptJohn, n2,xi) 

.hp  (sk 1 ( John, xl, hand, n2,nl) ,ni,hand)]  in  (ski (John, xl, hand, n2,nl)  ,arm) 

(hp  (John,  t (n2 , nl ) , hand) ] hp (John, n2,  arm) 

(hp (John, t (n2 , nl ), hand) ] [hp(John,n2,arm)]  in  (John, human) 

(hp (John,  t <n2,nl)  .hand)]  thp (John, n2, arm)]  tin(John,hi  in]]  in(John,bou] 
empty 


theorem 
extend  by  4a 

extend  by  7 
extend  by  3b 
extend  by  6 
extend  by  2 
extend  by  1 


4.  HAS-PARTS  Example  2 
Proof  A 


o.  hp  (John,  t (t  <n2,nl ) , nS ) , f ingers) 

9.  (hp  John, t (t <n2,nl) ,n5) , f ingers)]  hptJohn, t (n2,ni>  ,xl) 
hp  (sk  1 ( John,  xl,  f ingers, t <n2,nl)  ,n5)  ,n5,  f ingers) 

10.  (hp (John,  t (t  (n2,nl) ,n5) , f ingers)]  hp ( John, t <n2,nl) ,xl) 

(hp  (sk  l(John,xl,  fingers,  t (n2,nl ) ,n5)  ,n5,  fingers)] 

in  (sk 1 (John, xl , f inqers, t <n2,nl) ,n5) .hand) 

11.  (hp (John, t (t (n2,nl) ,n5) , f ingers)]  hp( John,  t (n2,nl) .hand) 

12.  (hp(John,t(t(n2,nl),n5), fingers)]  (hpfJohn,  t (n2,ni) .hand)]  hp  (John,n2, xl) 
hp (ski ( John, xl, hand, n2,nl),nl,  hand) 

13.  lhp(John,t(t(n2,nl),n5), fingers)]  (hp  (John,  t (n2,nl) .hand)]  hpfJohn, n2,xl) 
thp  (sk 1< John, xl, hand,  n2,nl),nl, hand)]  in  (ski  (John, xl, hand, n2,nl) , arm) 

14.  (hp ( John, t ( t (n2, nl) ,n5) , fingers)]  (hpfJohn,  t (n2,nl) .hand)]  hpfJohn, n2, arm) 

15.  (hpfJohn,  t(t(n2,nl),n5),fmg  s)]  tnpf  John,  t (n2,nl)  .hand)] 

(hpfJohn, n2,ar  i)  ] in  (John, human) 

IS.  (hptJohn, t (t <n2,nl) ,n5) , f ingers)]  (hp( John,  t (n2,nl) .hand)] 

(hp  (John,  n2,  arm)  ] ( i n (John,  human)]  m(John,boy) 

17.  empty 
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extend  by  7 
extend  by  3b 

extend  by  6 

extend  by  2 
extend  by  1 


Proof  B 


8.  hp(John,t(t(n2,nl),n5),fi ngors ) 

9.  (hp (John, t (t  (n^.nl)  ,n5) , f ingers)]  hptJohn, t (n2,ni) ,xl) 

- i n (sk 1 ( John, xl, fingers,  t <n2,n’ / ,n5)  ,xl) 

10.  (hp (John,  t (t  (n2,nl)  ,n5) , f ingers)]  hp (John, t (n2,nl) .hand) 

C" in  (ski  (John,  hand,  fingers,  t <n2,nl)  ,n5) , hand)] 

"hp <s* 1 (John, hand, fingers,! (n2,nl)  ,n5)  ,n5,  f ingers) 

11.  (hp ( John, t ( t (r2,nl) ,n5) , f ingers)]  hp(John,  t <n2,nl)  .hand) 

12.  (hpljohn, t(t(n2, nil, n5> , fingers)]  (hptJohn,  t (n2,nl) .hand)]  hp(John,n2,xl) 
hp(skl (John, xl, hand, n2,nl),nl, hand) 

13.  (hptJohn, t(t (n2,nl),n5), fingers)]  (hptJohn,  t (n2,ni) .hand))  hp(John,n2,xl) 
(hp (sk 1< John, xl, hand, n2,nl),nl, hand) 3 in  (ski  (John, xl, hand, n2,nl) , arm) 

14.  (hptJohn, t(t(n2,nl),n5), fingers)]  (hp  (John,  t (n2,nl] .hand))  hp(John,n2,arm) 

15.  (hptJohn, t (t  (n2,nl)  ,nr) , fingers)]  (hptJohn,  t (n2,ni) .hand)) 

(hp ( John ,n2, arm) ] in  (John, human) 

16.  (hptJohn, t(t(n2,nl),n5), fingers))  (hptJohn,  t (n2,nl) , hand)] 

(hptJohn, n2, arm)]  tin (John, human)]  in(John,boy) 

17.  empty 
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extend  by  4a 
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5.  PRin  Example 
Proof  A 


18. 

11. 

12. 

13. 

14. 

15. 

16. 

17. 

18. 

19. 

20. 
21. 
22. 

23. 

24. 

25. 

26. 

27. 

28. 


Pxl 

Pxl 

Pxl 

Pxl 

Pxl 


Oxla 
(Ox lal 
(Oxlal 
(Oxlal 
(Oxlal 
Pf  (g(a) ) 
Pf(gfa)) 

Pf  (g(a) ) 

P f (g  (a) ) 

Pf  (g  (a)  > 

Pf (g (a) ) 

Pf (g  (a) > 
(Pf (g (a) >1 
(Pf  (g(a))l 
(Pf  (g  (a) ) 1 
(Pf  (g(a))l 
(Pf  (g  (a)  > 1 
(Pf  (g (a) ) 1 
empty 


0x1x2  0x2a 
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Oxlg(a)  (Dg(a)al 
Oxlg (a) 
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(Of  (<|(a))al 
(Df(j(a))al 
(Of  (g(al)al 
(Of  (g (a)  > al 
(Of  (g(a))al 


-Pa 

(-Pal 


Oaa 


(Of (g (a) )g (all  Lnlg(a)  Lg (ala 
(Of (g(a))g(a)l  Lnlg(a)  (Lg (ala)  -Pa 
(Of  (g (a) ) g (a ) 1 Lnlg(a)  (Lg(a)al  i-Pa) 
(Of  <g(a))g(a)l  Lnlg(a) 

(Of  (g(a))g(a)l  (Lnlg(a)l  -Pa 
(Of  (g(a))g(a)l  (Lr.lg(a)]  (-Pa)  Oaa 


Oaa 


Lnlg(a) 

Lnlg (a) 

Lnlg (a) 

Lnlg (a) 

(Lnlg (all  -Pa 
(Lnlg(a)l  (-Pal 


Lg(a)a 
(Lg(a)al  -Pa 
(Lg(a)al  (-Pal 


Oaa 


Oaa 
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18.  Pxl  Oxla 

11.  Pa 

12.  (Pal  -Lnlg (a) 

13.  (Pal  (-Lnlg (all 

14.  (Pal  (-Lnlg (a) 1 

15.  (Pal  l -Lnlg (all 

16.  (Pal  (-Lnlg (all 

17.  (Pal  (-Lnlg (all 

18.  empty 


Lg  (ala  -PMg(a)) 

Lg(a)a  (-Pf  (g(a) ))  Of(g(a))a 

Lg(a)a  t-Pf(g(a))l  (Of(g(a))al  Df(g(a))xl  Oxla 

Lg(a)a  (-Pf(g(a)ll  (Of(g(a)lal  Of  (g  (a) ) g(a) 

Lg  (a)  a 


Proof  C 


10.  Pxl  Oxla 

11.  Pa 

12.  (Pal  -Dg(a)a 

13.  (Pal  (-Og(a)al  Oxlg  (a ) -Oxla 

14.  (Pa)  (-Og(a)a)  Oxlg(a)  (-Oxlal  Pxl 

15.  (Pa)  (-Og(a)a)  Of  (xl)g(a)  (-Of(xl)a)  lP»(xl>]  Lnlxl  Lxli 

16.  (Pal  (-Og(a)al  Of(g(a))g(a>  (-Of(g(a)la)  (Pf(g(a)l)  Ln'g(a) 

17.  (Pal  (-Og(a)al  Of(g(a))g(a) 

18.  (Pal  (-Og(a)al  (Of  (g(a))g(a)l  Lnlg(a)  Lg ( a) a 

19.  (Pal  (-Og(a)al  (Of  (g(a>)g(a)l  Lnlg(a) 

20.  empty 


6.  ANCES1  Example 


Proof 

A 

7. 

H - 

C 

8. 

H 

9. 

(Ml 

J -A 

10. 

(HI 

J (-A)  B 

11. 

(HI 

J 

12. 

(HI 

(J)  -K 

13. 

empty 

theorem 
extend  by  2 
extend  by  3a 
extend  by  9b 
extend  by  1 
extend  by  7 
extend  by  5a 
extend  by  9b 
extend  by  1 
extend  by  4a 
extend  by  9b 
extend  by  1 
extend  by  6 
extend  by  5a 
extend  by  9b 
extend  by  1 
extend  by  4a 
extend  by  9b 
extend  by  1 


theorem 
extend  by  1 
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extend  by  2b 
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Proof* 


7.  NUni  Example 
Proof  0 

7.  Dab 

8.  IDabl  Pa  hxlbx2  0ax2  -Oaxl 

9.  [Dab]  Pa  (Ibbxl  Oaxl 

10.  [Dab]  Pa  rtbbxl  [Oaxl]  0ax2xl 

11.  [Dab]  Pa  flbbslb) 

12.  [Dab]  Pa 

13.  empty 

8.  GR0UP1  Example 
Proof  0 

7.  Pj  (xl)xlj  <xl) 

8.  CP j (xl)xlj <xl)]  Px2x3 j <xl)  Px3xlx4  Px2x4 j (xl# 

9.  IP j (xl)xlj  (xl)]  Pg  <x2j  < x 1 > )x3j  <xl)  Px3xlx2 

18.  [Pj  (h(xlx2])h<xlx2)  j <h<xlx2))]  Pg  (x2  j (h(xlx2) ) )xl  j (h  <xlx2> ) 
11.  empty 

9.  CR0UP2  Example 
Proo'  A 

7.  Pbac 

8.  [Pbac]  Pbxlx2  Pxlx3a  Px2x3c 

9.  [Pbac]  Pbxle  Pxlca 

10.  [Pbac]  Pbxle  [Pxlca]  Px2x3xl  Px3cx4  Px2x4a 

11.  [Fbac]  Pbxle  [Pxlca]  Pax2xl  Px2ce 

12.  [Pbac]  Pbxle  [Pxlca]  Pacxl 

13.  [Pbac]  Pbxle  [Pxlca]  [Pacxl]  Pax2x3  Px2x4c  Px3x4xl 

14.  iPbacJ  Pbxle  [Pxlca]  [Pacxl]  Pax2#  Px2xlc 

15.  [Pbac]  Pbbe  [Pbca]  [Pacb]  Paae 

16.  [Pbac]  Pbbe 

17.  empty 

Proof  B 

7.  Pbac 

8.  [Pbac]  Pbxlx2  Pxlx3a  Px2>3c 

9.  [Pbac]  Pbxla  Pxlba 

10.  [Pbac)  Pbxla  [Pxlba]  Px2x3xl  Px3bx4  Px2x4a 

11.  [Pbac]  Pbxla  [Pxlba]  Pax2xl  Px2b» 

12.  [Pbac]  Pbxla  [Pxlba]  Pabxl 

13.  [Pbac]  Pbca 

14.  [Pbac]  [Pbca]  Pxlx2b  Px2cx3  Pxlx3a 

15.  [Pbac]  [Pbca]  Paxlb  Pxlce 

16.  [Pbac]  [Pbca]  Pacb 

17.  [Pbac]  [Pbca]  [Pacb]  Paxlx2  Pxlx3c  Px2x3b 

18.  [Pbac]  [Pbca]  [Pacb]  Paxle  Pxlbc 

19.  [Pbac]  [Pbca]  [Pacb]  Paae 

20.  empty 


theorem 
extend  by  la 
’■educe 
extend  by  4 
extend  by  5 
extend  by  2 
extend  by  6 


theorem 
extend  by  1 
extend  by  3 
extend  by  4 
extend  by  3 


theorem 
extend  by  3 
extend  by  2 
extend  by  4 
extend  by  1 
extend  by  5 
extend  by  3 
extend  by  2 
extend  by  6 
extend  by  5 
extend  by  5 


theorem 
extend  by  3 
extend  by  6 
extend  by  4 
extend  by  1 
extend  by  5 
extend  by  6 
extend  by  4 
extend  by  1 
extend  by  5 
extend  by  3 
extend  by  2 
extend  by  6 
extend  by  5 
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